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ANTI-CANCER NUCLEIC ACID AND PROTEIN TARGETS 

CROSS-REFERENCE TO RELATED APPLICATION 

The present application claims priority to U.S. Patent Application 
No. 60/161,232, filed October 22, 1999, herein incorporated by reference in its 
entirety. 

FIELD OF THE INVENTION 

This invention relates to the discovery of nucleic acids and proteins 
associated with cancer. The identification of these cancer-associated nucleic acids 
and proteins have diagnostic uses in detecting the cancerous state of a cell 
population as well as applications for gene therapy and the prevention of malignant 
tumor development. 

BACKGROUND OF THE INVENTION 

Despite years of research into its causes and potential treatments, 
cancer remains the second leading cause of death in the United States. Currently, 
more than 500 thousand Americans die of cancer each year, more than 1500 people 
a day. In addition, even more cases will be detected: in 1999, more than 1.2 million 
new cancer cases will be diagnosed. Of the many types of cancer, epithelial cancers 
are among the most prevalent and deadly. For example, about 175,000 new invasive 
cases of breast cancer are expected to occur among women in the United States 
during 1999, and more than 43,000 women will die of this disease. Although some 
progress has been made towards understanding the causes of various types of cancer, 
a major need remains for new tools for the diagnosis and treatment of this disease. 

Cancer is a genetic disease of single cell origin caused by the 

accumulation of inherited and acquired mutations in specific cancer-associated 

genes, which have normal cellular functions, but when mutated or present at 

abnormally high levels contribute to cancer. Such cancer-associated genes can be 

1 
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classified in two categories: genes that cause cancer when they become inactivated 
and genes that lead to cancer when they are mutated or overexpressed (Alberts et al 
(1994) Molecular Biology of the Cell, Third Edition. Garland Publishing, New- 
York, pp: 1255-1291). The first group of genes are often referred to as tumor- 
5 suppressor genes and include, e.g., the retinoblastoma gene, p53, etc. (Holden et al 
(1999) Med. Hypotheses 52:483-485; McCormick, F. (1999) Cancer J. Sci. Am. 
5:139-144; Prives etal, (1999) J. Pathol 187:112-126; Vooijs et al. (1999) 
Oncogene 18:5293-5303; Stiegler a /. (1998)/. Cell Biochem. Suppl. 30:30-36). 
They are required for the normal functioning of a cell, for example by restricting 

10 cellular proliferation or growth, or by controlling cell adhesion. Their inactivation 
or loss of their function results in the loss of these controls or properties and 
contributes to tumor development (Dictor et al (1999) Am. J. Clin. Pathol 1 12:S40- 
52; Bartek et al. (1997) Exp. Cell Res. 237:1-6; Kaelin (1997) Ann. N. Y. Acad. Sci. 
833:29-33). The genes belonging to the second class are referred to as oncogenes 

15 and they include, e.g., ras, myc, src, etc. (Introna et al (1999) Leukemia 13: 1301- 
1306; Nesbit et al. (1999) Oncogene 18:3004-3016; Schwab (1999) Semin. Cancer 
Biol 9:319-125; Stice etal. (1999) Front Biosci. 15:D72-86; Porter etal (1998) 
Oncogene 17:1343-1352). A mutation or any other event resulting in their 
overexpression can also contribute to cancer development, for example, by releasing 

20 cells from normal restraints on proliferation. Although several genes have been 

found in each of these categories, the mechanism underlying cancer development is 
still largely unknown. In addition, it is well established that cancer is often the result 
of a number of mutations in different genes (Alberts et al (1994) Molecular Biology 
of the Cell, Third Edition. Garland Publishing, New- York, pp: 1255-1291; Devereux 

25 et al. (1999) IARC Sci. Publ. 146: 19-42; Fearon et al. (1999) Curr. Biol. 28:R62-65; 
Murakami et al. (1998) Mutat. Res. 400: 421-437). There is therefore a major need 
in the field for identifying both genes that control the normal functioning of a cell 
and prevent its transformation into a malignant cell and genes whose expression 
needs to be strictly controlled to prevent cancer development. This knowledge 

30 would provide a major tool for diagnosing and treating cancer. 

The present invention is based on the discovery that a number of 
polynucleotide sequences or proteins are overexpressed in many types of cancer 
cells in mammals, while other polynucleotide sequences or proteins are 
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underexpressed in cancerous cells, compared to normal healthy tissues. As 
described herein, this discovery has provided novel and badly needed diagnostic, 
prognostic, and therapeutic tools for many types of cancers. 



5 SUMMARY OF THE INVENTION 

The present invention provides isolated nucleic acids and proteins 
associated with cancer. Such sequences can be used to determine the cancerous 
state of a cell population, e.g., whether cells are proliferating at an abnormally high 
rate, are undergoing transformation and/or are malignant. Moreover, the present 

10 invention provides sequences indicative of the proliferation state, transformation, 
malignancy or metastatic ability of a cell. The genes that exhibit cancer-associated 
alterations in expression can also be targeted and their level of expression altered by, 
for example, gene therapy methods (e.g., by altering the subject sequences). Such 
methods can be used, for example, to slow or stop the tumor development process, 

15 to arrest proliferation of a cell population, such as a tumor cell population, to inhibit 
transformation of a cell and to determine that a cell population is healthy. 

In one aspect, the present invention provides a method for diagnosing 
cancer of a tissue of interest in a patient, comprising: detecting the overexpression or 
the underexpression of a cancer-associated molecule in the tissue of interest in the 

20 patient, where overexpression or underexpression of such molecule is indicative of a 
cancer. In a preferred embodiment, the cancer-associated molecule is selected from 
the molecules listed in Table 1 . In one embodiment, underexpression of the cancer- 
associated molecule is indicative of cancer, and the cancer-associated molecule is 
underexpressed in the patient. In another embodiment, overexpression of the 

25 cancer- associated molecule is indicative of cancer, and the cancer- associated 

molecule is overexpressed in the patient. In a preferred embodiment, the cancer- 
associated molecule is an mRNA. In another preferred embodiment, the cancer- 
associated molecule is detected using an immunoassay. The tissue of interest is 
preferably selected from the group consisting of liver, breast, prostate, kidney and 

30 colon. 

In another aspect, the invention provides a method for arresting 
cancer, comprising introducing into a cell a cancer- associated molecule according to 
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Table 1, wherein underexpression of the cancer-associated molecule is indicative of 
cancer. The cancer-associated molecule of interest may be a nucleic acid encoding a 
cancer-associated protein or a protein. 

In still another aspect, the invention provides a method for arresting 
5 cancer comprising inhibiting a cancer-associated molecule according to Table 1, 

wherein overexpression of the cancer-associated molecule is indicative of cancer. In 
a preferred embodiment, the cancer-associated molecule is inhibited using an 
antisense polynucleotide. In another preferred embodiment, the cancer-associated 
molecule is inhibited using an antibody that specifically binds to said cancer- 
associated protein. 

The present invention also provides a method for identifying a 
modulator of cancer development, the method comprising: culturing a cell in the 
presence of the modulator to form a first cell culture; contacting RNA or cDNA 
from the first cell culture with a probe which comprises a polynucleotide sequence 
associated with cancer; determining whether the amount of the probe which 
hybridizes to the RNA or cDNA from the first cell culture is increased or decreased 
relative to the amount of the probe which hybridizes to RNA or cDNA from a 
second cell culture grown in the absence of the modulator, and further, detecting a 
phenotype indicative of an amelioration of the cancerous state of the cell population 
that is treated with the modulator. In a preferred embodiment, the probe comprises 
at least about 10 nucleotides from a polynucleotide sequence selected from the 
group consisting of the sequences set out in Table 1 which show increased 
expression in cancerous tissues. Amelioration of the cancerous state of a cell 
population may include, for example, a decrease in the proliferative potential, 
transformation, malignancy and/or metastatic ability of a cell. The first and second 
cell cultures may be obtained, for example, from a liver, a kidney, a breast, a colon 
or a prostate cell. 

In still another aspect, the present invention provides kits for carrying 
out the various methods. For instance, in one embodiment, a kit is provided for 
detecting whether a cell is undergoing transformation or becoming malignant, the kit 
comprising: a probe which comprises a polynucleotide sequence associated with 
cancer and a label for detecting the presence of the probe. In one embodiment, the 
probe comprises at least about 10 nucleotides from a polynucleotide sequence 

4 
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selected from the group consisting of the sequences listed in Table 1 . Additionally, 
this kit can further comprise a plurality of probes each of which comprises a 
polynucleotide sequence associated with cancer; and a label or labels for detecting 
the presence of the plurality of probes. The probes can optionally be immobilized 
5 on a solid support (e.g., a chip). 

The invention also embraces the use of antisense methods for 
examining cancer in animals and cells. Typically, any time a gene is identified, it 
can be examined by knocking out the gene in an animal and observing the effect on 
the animal phenotype. Knockouts can be achieved by transposons which insert by 

10 homologous recombinations, antisense or ribozymes specifically directed at 

disturbing the embryonic stem cells of an organism such as a mouse. Ribozymes 
can include any of the various types of ribozymes modified to cleave the mRNA 
encoding, for example, the cancer-associated protein. Examples include hairpins 
and hammerhead ribozymes. Finally, antisense molecules which selectively bind, 

15 for example, to the mRNA encoding a cancer-associated protein are expressed via 
expression cassettes operably linked to subsequences of the cancer-associated gene 
and generally comprise 20-50 base long sequences in opposite orientation to the 
mRNA to which they are targeted. 

The invention also provides a method for modulating cancer 

20 development in a patient in need thereof, the method comprising administering to 
the patient a compound that modulates cancer development. In a preferred 
embodiment, the modulator increases or decreases the expression of a nucleic acid 
sequence associated with cancerous transformation of a cell, e.g., a liver, kidney, 
breast, colon or prostate cell, set out in Table 1 . In this embodiment, the nucleic 

25 acid can, for example, comprise at least about 10 nucleotides from a polynucleotide 
sequence selected from the group consisting of the sequences set out in Table 1 . 



DEFINITIONS 

30 "Cancer" or "malignancy" are used as synonymous terms and refer to 

any of a number of diseases that are characterized by uncontrolled, abnormal 
proliferation of cells, the ability of affected cells to spread locally or through the 

5 
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bloodstream and lymphatic system to other parts of the body (i.e. metastasize) as 
well as any of a number of characteristic structural and/or molecular features. A 
"cancerous" or "malignant cell" is understood as a cell having specific structural 
properties, lacking differentiation and being capable of invasion and metastasis. 
5 Examples of cancers are kidney, colon, breast, prostate and liver cancer. 

"Neoplasia" or "tumor" describe new abnormal growth which can be 
benign or malignant. "Neoplastic proliferation" describes cell proliferation that 
persists even in the absence of growth stimulus. 

In the context of the invention, the term "transformation" refers to the 
10 change that a normal cell undergoes as it becomes malignant. In eukaryotes, the 
term "transformation" can be used to describe the conversion of normal cells to 
malignant cells in cell culture. 

"Proliferating cells" are those which are actively undergoing cell 
division and growing exponentially. "Loss of cell proliferation control" refers to the 
15 property of cells that have lost the cell cycle controls that normally ensure 

appropriate restriction of cell division. Cells that have lost such controls proliferate 
at a faster than normal rate, without stimulatory signals, and do not respond to 
inhibitory signals. 

"Amplification" primers are oligonucleotides comprising either 
20 natural or analog nucleotides that can serve as the basis for the amplification of a 
select nucleic acid sequence. They include, for example, both polymerase chain 
reaction primers and ligase chain reaction oligonucleotides. 

"Antibody" refers to a polypeptide substantially encoded by an 
immunoglobulin gene or immunoglobulin genes, or fragments thereof which 
25 specifically bind and recognize an analyte (antigen). The recognized 

immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, epsilon and 
mu constant region genes, as well as the myriad immunoglobulin variable region 
genes. Light chains are classified as either kappa or lambda. Heavy chains are 
classified as gamma, mu, alpha, delta, or epsilon, which in turn define the 
30 immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, respectively. 

An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, 
each pair having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). 

6 
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The N-terminus of each chain defines a variable region of about 100 to 1 10 or more 
amino acids primarily responsible for antigen recognition. The terms variable light 
chain (V L ) and variable heavy chain (V H ) refer to these light and heavy chains 
respectively. 

5 Antibodies exist, e.g., as intact immunoglobulins or as a number of 

well characterized fragments produced by digestion with various peptidases. Thus, 
for example, pepsin digests an antibody below the disulfide linkages in the hinge 
region to produce F(ab)' 2 , a dimer of Fab which itself is a light chain joined to V H - 
ChI by a disulfide bond. The F(ab) f 2 may be reduced under mild conditions to break 

10 the disulfide linkage in the hinge region, thereby converting the F(ab) f 2 dimer into an 
Fab' monomer. The Fab 1 monomer is essentially an Fab with part of the hinge 
region {see, Fundamental Immunology, Third Edition, W.E. Paul, ed., Raven Press, 
N.Y. 1993). While various antibody fragments are defined in terms of the digestion 
of an intact antibody, one of skill will appreciate that such fragments may be 

1 5 synthesized de novo either chemically or by utilizing recombinant DNA 

methodology. Thus, the term antibody, as used herein, also includes antibody 
fragments either produced by the modification of whole antibodies or those 
synthesized de novo using recombinant DNA methodologies {e.g., single chain Fv). 

"Cancer-associated" refers to the relationship of a nucleic acids and 

20 its expression, or lack thereof, or a protein and its level or activity, or lack thereof, to 
the onset of malignancy in a subject cell. For example, cancer can be associated 
with expression of a particular gene that is not expressed, or is expressed at a lower 
level, in a normal healthy cell. Conversely, a cancer-associated gene can be one that 
is not expressed in a malignant cell (or in a cell undergoing transformation), or is 

25 expressed at a lower level in the malignant cell than it is expressed in a normal 
healthy cell. 

"Biological samples" refers to any tissue or liquid sample having 
genomic DNA or other nucleic acids {e.g., mRNA) or proteins. It refers to samples 
of cells with a normal complement of chromosomes as well as samples of cells 
30 suspected of malignancy. 

The term "gene" means the segment of DNA involved in producing a 
polypeptide chain; it includes regions preceding and following the coding region 



7 
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(leader and trailer) as well as intervening sequences (introns) between individual 
coding segments (exons). 

The term "isolated," when applied to a nucleic acid or protein, 
denotes that the nucleic acid or protein is essentially free of other cellular 
5 components with which it is associated in the natural state. It is preferably in a 

homogeneous state although it can be in either a dry or aqueous solution. Purity and 
homogeneity are typically determined using analytical chemistry techniques such as 
polyacrylamide gel electrophoresis or high performance liquid chromatography. A 
protein which is the predominant species present in a preparation is substantially 

10 purified. In particular, an isolated gene is separated from open reading frames which 
flank the gene and encode a protein other than the gene of interest. The term 
"purified" denotes that a nucleic acid or protein gives rise to essentially one band in 
an electrophoretic gel. Particularly, it means that the nucleic acid or protein is at 
least 85% pure, more preferably at least 95% pure, and most preferably at least 99% 

15 pure. 

The term "nucleic acid" refers to deoxyribonucleotides or 
ribonucleotides and polymers thereof in either single- or double-stranded form. 
Unless specifically limited, the term encompasses nucleic acids containing known 
analogues of natural nucleotides which have similar binding properties as the 

20 reference nucleic acid and are metabolized in a manner similar to naturally occurring 
nucleotides. Unless otherwise indicated, a particular nucleic acid sequence also 
implicitly encompasses conservatively modified variants thereof (e.g., degenerate 
codon substitutions) and complementary sequences and as well as the sequence 
explicitly indicated. Specifically, degenerate codon substitutions may be achieved 

25 by generating sequences in which the third position of one or more selected (or all) 
codons is substituted with mixed-base and/or deoxyinosine residues (Batzer et aL, 
Nucleic Acid Res. 79:5081 (1991); Ohtsuka et aL, J. Biol. Chem. 260:2605-2608 
(1985); and Cassol et aL, 1992; Rossolini et aL, MoL Cell. Probes 5:91-98 (1994)). 
The term nucleic acid is used interchangeably with gene, cDNA, and mRNA 

30 encoded by a gene. 

As used herein a "nucleic acid probe" is defined as a nucleic acid 
capable of binding to a target nucleic acid {e.g., a nucleic acid associated with 
cancer) of complementary sequence through one or more types of chemical bonds, 
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usually through complementary base pairing, usually through hydrogen bond 
formation. As used herein, a probe may include natural (i.e., A, G, C, or T) or 
modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe 
may be joined by a linkage other than a phosphodiester bond, so long as it does not 

5 interfere with hybridization. Thus, for example, probes may be peptide nucleic acids 
in which the constituent bases are joined by peptide bonds rather than 
phosphodiester linkages. It will be understood by one of skill in the art that probes 
may bind target sequences lacking complete complementarity with the probe 
sequence depending upon the stringency of the hybridization conditions. 

0 Nucleic acid probes can be DNA or RNA fragments. DNA 

fragments can be prepared, for example, by digesting plasmid DNA, or by use of 
PCR, or synthesized by either the phosphoramidite method described by Beaucage 
and Carruthers, Tetrahedron Lett. 22:1859-1862 (1981) (Beaucage and Carruthers), 
or by the triester method according to Matteucci, et aL 9 J. Am. Chem. Soc, 103:3185 

5 (1981) (Matteucci), both incorporated herein by reference. A double stranded 

fragment may then be obtained, if desired, by annealing the chemically synthesized 
single strands together under appropriate conditions, or by synthesizing the 
complementary strand using DNA polymerase with an appropriate primer sequence. 
Where a specific sequence for a nucleic acid probe is given, it is understood that the 

0 complementary strand is also identified and included. The complementary strand 
will work equally well in situations where the target is a double-stranded nucleic 
acid. 

A "labeled nucleic acid probe" is a nucleic acid probe that is bound, 
either covalently, through a linker, or through ionic, van der Waals or hydrogen 

5 bonds to a label such that the presence of the probe may be detected by detecting the 
presence of the label bound to the probe. 

The phrase "a nucleic acid sequence encoding" refers to a nucleic 
acid which contains sequence information for a structural RNA such as rRNA, a 
tRNA, or the primary amino acid sequence of a specific protein or peptide, or a 

0 binding site for a trans-acting regulatory agent. This phrase specifically 

encompasses degenerate codons (i.e., different codons which encode a single amino 
acid) of the native sequence or sequences which may be introduced to conform with 
codon preference in a specific host cell. 

9 
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"Stringent hybridization conditions" and "stringent hybridization 
wash conditions" in the context of nucleic acid hybridization experiments, such as 
Southern and northern hybridizations, are sequence dependent, and are different 
under different environmental parameters. Longer sequences hybridize specifically 

5 at higher temperatures. An extensive guide to the hybridization of nucleic acids is 
found in Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular 
Biology—Hybridization with Nucleic Acid Probes, part I, chapter 2 "Overview of 
principles of hybridization and the strategy of nucleic acid probe assays," Elsevier, 
NY. Generally, highly stringent hybridization and wash conditions are selected to 

0 be about 5° C lower than the thermal melting point (T m ) for the specific sequence at 
a defined ionic strength and pH. Typically, under "stringent conditions," a probe will 
hybridize to its target subsequence, but to no other sequences. 

The T m is the temperature (under defined ionic strength and pH) at 
which 50% of the target sequence hybridizes to a perfectly matched probe. Very 

5 stringent conditions are selected to be equal to the T m for a particular probe. An 
example of stringent hybridization conditions for hybridization of complementary 
nucleic acids which have more than 100 complementary residues on a filter in a 
Southern or northern blot is 50% formamide with 1 mg of heparin at 42°C, with the 
hybridization being carried out overnight. An example of highly stringent wash 

0 conditions is 0.1 5M NaCl at 72°C for about 15 minutes. An example of stringent 
wash conditions is a 0.2x SSC wash at 65°C for 15 minutes (see f Sambrook, supra, 
for a description of SSC buffer). Often, a high stringency wash is preceded by a low 
stringency wash to remove background probe signal. An example medium 
stringency wash for a duplex of, e.g., more than 100 nucleotides, is lx SSC at 45°C 

5 for 15 minutes. An example low stringency wash for a duplex of, e.g., more than 
100 nucleotides, is 4-6x SSC at 40°C for 15 minutes. For short probes (e.g., about 
10 to 50 nucleotides), stringent conditions typically involve salt concentrations of 
less than about 1.0 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or 
other salts) at pH 7.0 to 8.3, and the temperature is typically at least about 30°C. 

0 Stringent conditions can also be achieved with the addition of destabilizing agents 
such as formamide. In general, a signal to noise ratio of 2x (or higher) than that 
observed for an unrelated probe in the particular hybridization assay indicates 
detection of a specific hybridization. Nucleic acids which do not hybridize to each 

10 
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other under stringent conditions are still substantially identical if the polypeptides 
which they encode are substantially identical. This occurs, e.g., when a copy of a 
nucleic acid is created using the maximum codon degeneracy permitted by the 
genetic code. 

5 The phrase "specifically (or selectively) binds to an antibody" or 

"specifically (or selectively) immunoreactive with", when referring to a protein or 
peptide, refers to a binding reaction which is determinative of the presence of the 
protein in the presence of a heterogeneous population of proteins and other 
biologies. Thus, under designated immunoassay conditions, the specified antibodies 

0 bind to a particular protein and do not bind in a significant amount to other proteins 
present in the sample. Specific binding to an antibody under such conditions may 
require an antibody that is selected for its specificity for a particular protein. For 
example, antibodies raised to the protein with the amino acid sequence encoded by 
any of the polynucleotides of the invention can be selected to obtain antibodies 

5 specifically immunoreactive with that protein and not with other proteins except for 
polymorphic variants. A variety of immunoassay formats may be used to select 
antibodies specifically immunoreactive with a particular protein. For example, 
solid-phase ELISA immunoassays, Western blots, or immunohistochemistry are 
routinely used to select monoclonal antibodies specifically immunoreactive with a 

0 protein. See, Harlow and Lane (1988) Antibodies, A Laboratory Manual, Cold 
Spring Harbor Publications, New York ("Harlow and Lane") for a description of 
immunoassay formats and conditions that can be used to determine specific 
immunoreactivity. Typically, a specific or selective reaction will be at least twice 
background signal or noise and more typically more than 10 to 100 times 

5 background. 



DETAILED DESCRIPTION AND 
PREFERRED EMBODIMENTS OF THE INVENTION 

The present invention provides nucleic acids and proteins that are 
indicative of cancer. Host cells, vectors, and probes are described, as are antibodies 
to the proteins and uses of the proteins as antigens. The present invention provides 
methods for obtaining and expressing nucleic acids, methods for purifying gene 

11 
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products, other methods that can be used to detect and quantify the expression and 
quality of the gene product (e.g., proteins), and uses for both the nucleic acids and 
the gene products. 

5 This invention relies on routine techniques in the field of recombinant 

genetics. A basic text disclosing the general methods of use in this invention is 
Sambrook et al., Molecular Cloning, A Laboratory Manual, Cold Spring Harbor 
Publish., Cold Spring Harbor, NY 2nd ed. (1989) and Kriegler, Gene Transfer and 
Expression: A Laboratory Manual, W.H. Freeman, N.Y., (1990), which are both 

3 incorporated herein by reference. Unless otherwise stated all enzymes are used in 
accordance with the manufacturer's instructions. 

I. Detection of Gene Expression and Genomic Analysis of Cancer- 
Associated Proteins. 

The polynucleotides and polypeptides of the present invention can be 
5 employed as research reagents and materials for discovery of treatments and 

diagnostics to human disease. It will be readily apparent to those of skill in the art 
that although the following discussion is directed to methods for detecting nucleic 
acids associated with cancer, similar methods can be used to detect nucleic acids 
associated with, e.g., cell proliferation, cell transformation, neoplasia, metastasis, 
3 etc. 

As should be apparent to those of skill in the art, the invention is the 
identification of cancer-associated genes and the discovery that multiple nucleic 
acids are associated with cancer development, cell proliferation, cell transformation, 
neoplasia and/or metastasis. Accordingly, the present invention also includes 

5 methods for detecting the presence, alteration or absence of the such cancer- 
associated nucleic acids (e.g., DNA or RNA) in a physiological specimen in order to 
determine the level of proliferation and/or transformation of cells in vitro, or ex vivo, 
as well as the genotype and risk of cancer development associated with mutations 
created in cancer-associated sequences. Although any tissue having cells bearing 

3 the genome of an individual, or RNA associated with cancer, can be used, the most 
convenient specimen will be blood samples or biopsies of suspect tissue. It is also 
possible and preferred in some circumstances to conduct assays on cells that are 

12 
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isolated under microscopic visualization. A particularly useful method is the 
microdissection technique described in PCT Published Application No. WO 
95/23960. The cells isolated by microscopic visualization can be used in any of the 
assays described herein including both genomic and immunological based assays. 
5 This invention provides for methods of genotyping family members 

in which relatives are diagnosed with cancer. Conventional methods of genotyping 
are provided herein. 

The invention provides methods for detecting whether a cell is in a 
cancerous state or is undergoing transformation and/or becoming malignant. The 

10 methods typically comprise contacting RNA from the cell with a probe which 
comprises a polynucleotide sequence associated with cancer, and determining 
whether the amount of the probe which hybridizes to the RNA is increased or 
decreased relative to the amount of the probe which hybridizes to RNA from a non- 
cancerous cell. The assays are useful for detecting cell transformation associated 

1 5 with, for example, different types of cancer. One can also detect cell proliferation, 
neoplasia and/or transformation using the methods of the invention. 

The probes are capable of binding to a target nucleic acid (e.g., a 
nucleic acid associated with cancer development). By assaying for the presence or 
absence of the probe, one can detect the presence or absence of the target nucleic 

20 acid in a sample. Preferably, non-hybridizing probe and target nucleic acids are 
removed (e.g., by washing) prior to detecting the presence of the probe. 

A variety of methods of specific DNA and RNA measurement using 
nucleic acid hybridization techniques are known to those of skill in the art. See, 
Sambrook, supra. For example, one method for evaluating the presence or absence 

25 of the DNA in a sample involves a Southern transfer. Briefly, the digested genomic 
DNA is run on agarose slab gels in buffer and transferred to membranes. 
Hybridization is carried out using the probes discussed above. Visualization of the 
hybridized portions allows the qualitative determination of the presence, alteration 
or absence of a cancer-associated gene. 

30 Similarly, a Northern transfer may be used for the detection of 

cancer-associated mRNA in samples of RNA from cells expressing the cancer- 
associated proteins. In brief, the mRNA is isolated from a given cell sample using 
an acid guanidinium-phenol-chloroform extraction method. The mRNA is then 
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electrophoresed to separate the mRNA species and the mRNA is transferred from 
the gel to a nitrocellulose membrane. As with the Southern blots, labeled probes are 
used to identify the presence or absence of the subject protein transcript. 
Alternatively, the amount of, for example, a cancer-associated mRNA can be 

5 analyzed in the absence of electrophoretic separation. 

The selection of a nucleic acid hybridization format is not critical. A 
variety of nucleic acid hybridization formats are known to those skilled in the art. 
For example, common formats include sandwich assays and competition or 
displacement assays. Hybridization techniques are generally described in "Nucleic 

0 Acid Hybridization, A Practical Approach" Ed. Hames, B.D. and Higgins, S.J., IRL 
Press, 1985; Gall and Pardue (1969), Proc. Natl Acad. ScL, U.S.A., 63:378-383; 
and John, Burnsteil and Jones (1969) Nature, 223:582-587. 

For example, sandwich assays are commercially useful hybridization 
assays for detecting or isolating nucleic acids. Such assays utilize a "capture" 

5 nucleic acid covalently immobilized to a solid support and labeled "signal" nucleic 
acid in solution. The clinical sample will provide the target nucleic acid. The 
"capture" nucleic acid and "signal" nucleic acid probe hybridize with the target 
nucleic acid to form a "sandwich" hybridization complex. To be effective, the signal 
nucleic acid cannot hybridize with the capture nucleic acid. 

0 Detection of a hybridization complex may require the binding of a 

signal generating complex to a duplex of target and probe polynucleotides or nucleic 
acids. Typically, such binding occurs through ligand and anti-ligand interactions as 
between a ligand-conjugated probe and an anti-ligand conjugated with a signal. The 
binding of the signal generation complex is also readily amenable to accelerations 

5 by exposure to ultrasonic energy. 

The label may also allow indirect detection of the hybridization 
complex. For example, where the label is a hapten or antigen, the sample can be 
detected by using antibodies. In these systems, a signal is generated by attaching 
fluorescent or enzyme molecules to the antibodies or in some cases, by attachment 

0 to a radioactive label (see, e.g., Tijssen, P., "Practice and Theory of Enzyme 

Immunoassays," Laboratory Techniques in Biochemistry and Molecular Biology, 
Burdon, R.H., van Knippenberg, P.H., Eds., Elsevier (1985), pp. 9-20). 
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The probes are typically labeled directly, as with isotopes, 
chromophores, lumiphores, chromogens, or indirectly labeled such as with biotin to 
which a streptavidin complex may later bind. Thus, the detectable labels used in the 
assays of the present invention can be primary labels (where the label comprises an 
5 element that is detected directly or that produces a directly detectable element) or 
secondary labels (where the detected label binds to a primary label, e.g., as is 
common in immunological labeling). Typically, labeled signal nucleic acids are used 
to detect hybridization. Complementary nucleic acids or signal nucleic acids may be 
labeled by any one of several methods typically used to detect the presence of 
1 0 hybridized polynucleotides. The most common method of detection is the use of 
autoradiography with 3 H, ,25 I, 35 S, l4 C, or 32 P-labeled probes or the like. 

Other labels include ligands which bind to labeled antibodies, 
fluorophores, chemi-luminescent agents, enzymes, and antibodies which can serve 
as specific binding pair members for a labeled ligand. An introduction to labels, 
15 labeling procedures and detection of labels is found in Polak and Van Noorden 

(1997) Introduction to Immunocytochemistry, 2nd ed., Springer Verlag, New York, 
and in Haugland (1996) Handbook of Fluorescent Probes and Research Chemicals, 
a combined handbook and catalogue Published by Molecular Probes, Inc., Eugene, 
OR. Primary and secondary labels can include undetected elements as well as 
20 detected elements. Useful primary and secondary labels in the present invention can 
include spectral labels such as fluorescent dyes (e.g., fluorescein and derivatives 
such as fluorescein isothiocyanate (FITC) and Oregon Green™, rhodamine and 
derivatives (e.g., Texas red, tetrarhodimine isothiocynate (TRITC), etc.), 
digoxigenin, biotin, phycoerythrin, AMCA, CyDyes™, and the like), radiolabels 
(e.g., 3 H, 125 1, 35 S, 14 C, 32 P, 33 P, etc.), enzymes (e.g., horse radish peroxidase, 
alkaline phosphatase etc.), spectral colorimetric labels such as colloidal gold or 
colored glass or plastic (e.g., polystyrene, polypropylene, latex, etc.) beads. The 
label may be coupled directly or indirectly to a component of the detection assay 
(e.g., the probe) according to methods well known in the art. As indicated above, a 
wide variety of labels may be used, with the choice of label depending on sensitivity 
required, ease of conjugation with the compound, stability requirements, available 
instrumentation, and disposal provisions. 
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Preferred labels include those that use: 1) chemiluminescence (using 
horseradish peroxidase and/or alkaline phosphatase with substrates that produce 
photons as breakdown products as described above) with kits being available, e.g., 
from Molecular Probes, Amersham, Boehringer-Mannheim, and Life Technologies/ 
5 Gibco BRL; 2) color production (using both horseradish peroxidase and/or alkaline 
phosphatase with substrates that produce a colored precipitate [kits available from 
Life Technologies/Gibco BRL, and Boehringer-Mannheim]); 3) hemifluorescence 
using, e.g., alkaline phosphatase and the substrate AttoPhos [Amersham] or other 
substrates that produce fluorescent products, 4) fluorescence (e.g., using Cy-5 

10 [Amersham]), fluorescein, and other fluorescent tags]; and 5) radioactivity. Other 
methods for labeling and detection will be readily apparent to one skilled in the art. 

Preferred enzymes that can be conjugated to detection reagents of the 
invention include, e.g., p-galactosidase, luciferase, horse radish peroxidase, and 
alkaline phosphatase. The chemiluminescent substrate for luciferase is luciferin. 

15 One embodiment of a chemiluminescent substrate for p-galactosidase is 4- 
methylumbelliferyl-p-D-galactoside. Embodiments of alkaline phosphatase 
substrates include p-nitrophenyl phosphate (pNPP), which is detected with a 
spectrophotometer; 5-bromo-4-chloro-3-indolyl phosphate/nitro blue tetrazolium 
(BCIP/NBT) and fast red/napthol AS-TR phosphate, which are detected visually; 

20 and 4-methoxy-4-(3-phosphonophenyl) spiro[l,2-dioxetane-3,2 f -adamantane], which 
is detected with a luminometer. Embodiments of horse radish peroxidase substrates 
include 2,2 , azino-bis(3 -ethylbenzthiazoline-6 sulfonic acid) (ABTS), 5- 
aminosalicylic acid (5 AS), o-dianisidine, and o-phenylenediamine (OPD), which are 
detected with a spectrophotometer; and 3,3,5,5'-tetramethylbenzidine (TMB), 

25 3,3'diaminobenzidine (DAB), 3-amino-9-ethylcarbazole (AEC), and 4-chloro-l- 

naphthol (4C1N), which are detected visually. Other suitable substrates are known 
to those skilled in the art. The enzyme-substrate reaction and product detection are 
performed according to standard procedures known to those skilled in the art and 
kits for performing enzyme immunoassays are available as described above. 

30 In general, a detector which monitors a particular probe or probe 

combination is used to detect the detection reagent label. Typical detectors include 
spectrophotometers, phototubes and photodiodes, microscopes, scintillation 
counters, cameras, film and the like, as well as combinations thereof. Examples of 
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suitable detectors are widely available from a variety of commercial sources known 
to persons of skill in the art. Commonly, an optical image of a substrate comprising 
bound labeling moieties is digitized for subsequent computer analysis. 

Most typically, the amount of, for example, a cancer-associated RNA 
5 is measured by quantitating the amount of label fixed to the solid support by binding 
of the detection reagent. Typically, presence of a modulator during incubation will 
increase or decrease the amount of label fixed to the solid support relative to a 
control incubation which does not comprise the modulator, or as compared to a 
baseline established for a particular reaction type. Means of detecting and 

10 quantitating labels are well known to those of skill in the art. Thus, for example, 
where the label is a radioactive label, means for detection include a scintillation 
counter or photographic film as in autoradiography. Where the label is optically 
detectable, typical detectors include microscopes, cameras, phototubes and 
photodiodes and many other detection systems which are widely available. 

15 In preferred embodiments, the target nucleic acid or the probe is 

immobilized on a solid support. Solid supports suitable for use in the assays of the 
invention are known to those of skill in the art. As used herein, a solid support is a 
matrix of material in a substantially fixed arrangement. Exemplar solid supports 
include glasses, plastics, polymers, metals, metalloids, ceramics, organics, etc. Solid 

20 supports can be flat or planar, or can have substantially different conformations. For 
example, the substrate can exist as particles, beads, strands, precipitates, gels, sheets, 
tubing, spheres, containers, capillaries, pads, slices, films, plates, dipsticks, slides, 
etc. Magnetic beads or particles, such as magnetic latex beads and iron oxide 
particles, are examples of solid substrates that can be used in the methods of the 

25 invention. Magnetic particles are described in, for example, US Patent No. 
4,672,040, and are commercially available from, for example, PerSeptive 
Biosystems, Inc. (Framingham MA), Ciba Corning (Medfield MA), Bangs 
Laboratories (Carmel IN), and BioQuest, Inc. (Atkinson NH). The substrate is 
chosen to maximize signal to noise ratios, primarily to minimize background 

30 binding, for ease of washing and cost. 

A variety of automated solid-phase assay techniques are also 
appropriate. For instance, very large scale immobilized polymer arrays 
(VLSIPS™), available from Affymetrix, Inc. in Santa Clara, CA can be used to 
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detect changes in expression levels of a plurality of cancer-associated nucleic acids 
simultaneously. See, Tijssen, supra,, Fodor et al (1991) Science, 251 : 767- 777; 
Sheldon et al (1993) Clinical Chemistry 39(4): 718-719, and Kozal et al (1996) 
Nature Medicine 2(7): 753-759. Thus, in one embodiment, the invention provides 

5 methods of detecting expression levels of cancer-associated nucleic acids, in which 
nucleic acids (e.g., RNA from a cell culture) are hybridized to an array of nucleic 
acids that are known to be associated with malignant tumor development. For 
example, in the assay described, supra, oligonucleotides which hybridize to a 
plurality of cancer-associated nucleic acids are optionally synthesized on a DNA 

0 chip (such chips are available from Affymetrix) and the RNA from a biological 

sample, such as a cell culture, is hybridized to the chip for simultaneous analysis of 
multiple cancer-associated nucleic acids. The cancer-associated nucleic acids that 
are present in the sample which is assayed are detected at specific positions on the 
chip. 

5 Detection can be accomplished, for example, by using a labeled 

detection moiety that binds specifically to duplex nucleic acids (e.g., an antibody 
that is specific for RNA-DNA duplexes). One preferred example uses an antibody 
that recognizes DNA-RNA heteroduplexes in which the antibody is linked to an 
enzyme (typically by recombinant or covalent chemical bonding). The antibody is 

0 detected when the enzyme reacts with its substrate, producing a detectable product. 
Coutlee et al (1989) Analytical Biochemistry 181 :153-162; Bogulavski et al (1986) 
J. Immunol. Methods 89:123-130; Prooijen-Knegt (1982) Exp. Cell Res. 141:397- 
407; Rudkin (1976) Nature 265:472-473, Stollar (1970) PNAS 65:993-1000; Ballard 
(1982) Mol Immunol. 19:793-799; Pisetsky and Caster (1982) Mol Immunol. 

5 19:645-650; Viscidi et al (1988) J. Clin. Microbial 41 :199-209, and Kiney et al 
(1989) J. Clin. Microbiol 27:6-12 describe antibodies to RNA duplexes, including 
homo and heteroduplexes. Kits comprising antibodies specific for DNA:RNA 
hybrids are available, e.g., from Digene Diagnostics, Inc. (Beltsville, MD). 

In addition to available antibodies, one of skill in the art can easily 

0 make antibodies specific for nucleic acid duplexes using existing techniques, or 
modify those antibodies which are commercially or publicly available. In addition 
to the art referenced above, general methods of producing polyclonal and 
monoclonal antibodies are known to those of skill in the art. See, e.g., Paul (ed) 
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(1993) Fundamental Immunology, Third Edition Raven Press, Ltd., New York 
Coligan (1991) Current Protocols in Immunology Wiley/Greene, NY; Harlow and 
Lane (1989) Antibodies: A Laboratory Manual Cold Spring Harbor Press, NY; 
Stites et al. (eds.) Basic and Clinical Immunology (4th ed.) Lange Medical 

5 Publications, Los Altos, CA, and references cited therein; Goding (1986) 

Monoclonal Antibodies: Principles and Practice (2d ed.) Academic Press, New 
York, NY; and Kohler and Milstein (1975) Nature 256: 495-497. Other suitable 
techniques for antibody preparation include selection of libraries of recombinant 
antibodies in phage or similar vectors. See, Huse et al. (1989) Science 246: 1275- 

0 1281; and Ward et al (1989) Nature 341 : 544-546. Specific monoclonal and 

polyclonal antibodies and antisera will usually bind with a K D of at least about 0. 1 
|jM, preferably at least about 0.01 \xM or better, and most typically and preferably, 
0.001 jaM or better. 

The nucleic acids used in this invention can be either positive or 

5 negative probes. Positive probes bind to their targets and the presence of duplex 
formation is evidence of the presence of the target. Negative probes fail to bind to 
the suspect target and the absence of duplex formation is evidence of the presence of 
the target. For example, the use of a wild type specific nucleic acid probe or PCR 
primers may serve as a negative probe in an assay sample where only the nucleotide 

0 sequence of interest is present. 

The sensitivity of the hybridization assays may be enhanced through 
use of a nucleic acid amplification system which multiplies the target nucleic acid 
being detected. Examples of such systems include the polymerase chain reaction 
(PCR) system and the ligase chain reaction (LCR) system. Other methods recently 

5 described in the art are the nucleic acid sequence based amplification (NASBAd, 
Cangene, Mississauga, Ontario) and Q Beta Replicase systems. These systems can 
be used to directly identify mutants where the PCR or LCR primers are designed to 
be extended or ligated only when a selected sequence is present. Alternatively, the 
selected sequences can be generally amplified using, for example, nonspecific PCR 

3 primers and the amplified target region later probed for a specific sequence 
indicative of a mutation. 

A preferred embodiment is the use of allelic specific amplifications. 
In the case of PCR, the amplification primers are designed to bind to a portion of, 
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for example, a gene encoding a cancer-associated protein, but the terminal base at 
the y end is used to discriminate between the mutant and wild-type forms of the 
cancer-associated protein gene. If the terminal base matches the point mutation or 
the wild-type, polymerase dependent three prime extension can proceed and an 
5 amplification product is detected. This method for detecting point mutations or 
polymorphisms is described in detail by Sommer, S.S., et aL, in Mayo Clin. Proc. 
64:1361-1372 (1989), incorporated herein by reference. By using appropriate 
controls, one can develop a kit having both positive and negative amplification 
products. The products can be detected using specific probes or by simply detecting 
10 their presence or absence. A variation of the PCR method uses LCR where the point 
of discrimination, i.e, either the point mutation or the wild-type bases fall between 
the LCR oligonucleotides. The ligation of the oligonucleotides becomes the means 
for discriminating between the mutant and wild-type forms of the cancer-associated 
protein gene. 

15 An alternative means for determining the level of expression of the 

nucleic acids of the present invention is in situ hybridization. In situ hybridization 
assays are well known and are generally described in Angerer, et aL, Methods 
EnzymoL, 152:649-660 (1987). In an in situ hybridization assay cells, preferentially 
bovine lymphocytes, are fixed to a solid support, typically a glass slide. If DNA is 

20 to be probed, the cells are denatured with heat or alkali. The cells are then contacted 
with a hybridization solution at a moderate temperature to permit annealing of 
specific probes that are labeled. The probes are preferably labeled with 
radioisotopes or fluorescent reporters. 

II, Immunological Detection of a Cancer- Associated Protein 

25 In addition to the detection of the subject protein gene expression 

using nucleic acid hybridization technology, one can also use immunoassays to 
detect the protein itself. Immunoassays can be used to qualitatively or quantitatively 
analyze the proteins of interest. A general overview of the applicable technology 
can be found in Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring 

30 Harbor Pubs., N.Y. (1988), incorporated herein by reference. Although the 

following discussion is directed to methods for detecting target proteins associated 
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with cancer, similar methods can be used to detect methods associated with cell 
proliferation, cell transformation, neoplasia, metastasis and/or target proteins 
associated with different types of cancer (e.g., liver, kidney, breast, prostate, colon, 
etc.). 

5 A. Antibodies to Target Proteins 

Methods of producing polyclonal and monoclonal antibodies that 
react specifically with a protein of interest are known to those of skill in the art. See, 
e.g., Coligan (1991), Current Protocols in Immunology, Wiley/Greene, NY; and 
Harlow and Lane; Stites et al. (eds.) Basic and Clinical Immunology (4th ed.) Lange 

10 Medical Publications, Los Altos, CA, and references cited therein; Goding (1986), 
Monoclonal antibodies: Principles and Practice (2d ed.) Academic Press, New York, 
NY; and Kohler and Milstein (1975) Nature, 256:495-497. Such techniques include 
antibody preparation by selection of antibodies from libraries of recombinant 
antibodies in phage or similar vectors. See, Huse et al. (1989) Science, 246:1275- 

15 1281; and Ward et al. (1989) Nature, 341:544-546. For example, in order to 

produce antisera for use in an immunoassay, the proteins of interest or an antigenic 
fragment thereof, is isolated as described herein. For example, recombinant protein 
is produced in a transformed cell line. An inbred strain of mice or rabbits is 
immunized with the protein using a standard adjuvant, such as Freund's adjuvant, 

20 and a standard immunization protocol. Alternatively, a synthetic peptide derived 
from the sequences disclosed herein and conjugated to a carrier protein can be used 
as an immunogen. 

Polyclonal sera are collected and titered against the immunogen 
protein in an immunoassay, for example, a solid phase immunoassay with the 

25 immunogen immobilized on a solid support. Polyclonal antisera with a titer of 10 4 
or greater are selected and tested for their cross reactivity against non-cancer- 
associated proteins or even other homologous proteins from other organisms, using a 
competitive binding immunoassay. Specific monoclonal and polyclonal antibodies 
and antisera will usually bind with a K D of at least about 0.1 mM, more usually at 

30 least about 1 \\M, preferably at least about 0.1 pM or better, and most preferably, 
0.01 pM or better. 
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A number of proteins of the invention comprising immunogens may 
be used to produce antibodies specifically or selectively reactive with the proteins of 
interest. Recombinant protein is the preferred immunogen for the production of 
monoclonal or polyclonal antibodies. Naturally occurring protein may also be used 

5 either in pure or impure form. Synthetic peptides made using the protein sequences 
described herein may also be used as an immunogen for the production of antibodies 
to the protein. Recombinant protein can be expressed in eukaryotic or prokaryotic 
cells and purified as generally described infra. The product is then injected into an 
animal capable of producing antibodies. Either monoclonal or polyclonal antibodies 

0 may be generated, for subsequent use in immunoassays to measure the protein. 

Methods of production of polyclonal antibodies are known to those of 
skill in the art. In brief, an immunogen, preferably a purified protein, is mixed with 
an adjuvant and animals are immunized. The animal's immune response to the 
immunogen preparation is monitored by taking test bleeds and determining the titer 

5 of reactivity to a cancer-associated protein. When appropriately high titers of 
antibody to the immunogen are obtained, blood is collected from the animal and 
antisera are prepared. Further fractionation of the antisera to enrich for antibodies 
reactive to the protein can be done if desired (see, Harlow and Lane, supra). 
Monoclonal antibodies may be obtained by various techniques 

0 familiar to those skilled in the art. Briefly, spleen cells from an animal immunized 
with a desired antigen are immortalized, commonly by fusion with a myeloma cell 
(See, Kohler and Milstein, Eur. J. Immunol 6:51 1-519 (1976), incorporated herein 
by reference). Alternative methods of immortalization include transformation with 
Epstein Barr Virus, oncogenes, or retroviruses, or other methods well known in the 

5 art. Colonies arising from single immortalized cells are screened for production of 
antibodies of the desired specificity and affinity for the antigen, and yield of the 
monoclonal antibodies produced by such cells may be enhanced by various 
techniques, including injection into the peritoneal cavity of a vertebrate host. 
Alternatively, one may isolate DNA sequences which encode a monoclonal antibody 

3 or a binding fragment thereof by screening a DNA library from human B cells 

according to the general protocol outlined by Huse, et al. (1989) Science 246: 1275- 
1281. 
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Once target protein specific antibodies are available, the protein can 
be measured by a variety of immunoassay methods with qualitative and quantitative 
results available to the clinician. For a review of immunological and immunoassay 
procedures in general see, Basic and Clinical Immunology 7th Edition (D. Stites and 

5 A. Terr ed.) 1991. Moreover, the immunoassays of the present invention can be 
performed in any of several configurations, which are reviewed extensively in 
Enzyme Immunoassay, E.T. Maggio, ed., CRC Press, Boca Raton, Florida (1980); 
"Practice and Theory of Enzyme Immunoassays," Tijssen; and, Harlow and Lane, 
each of which is incorporated herein by reference. 

0 Immunoassays to measure target proteins in a human sample may use 

a polyclonal antiserum which was raised to the protein partially encoded by a 
sequence described herein or a fragment thereof. This antiserum is selected to have 
low crossreactivity against non-cancer-associated proteins and any such 
crossreactivity is removed by immunoabsorption prior to use in the immunoassay. 

5 In order to produce antisera for use in an immunoassay, cancer- 

associated protein or a fragment thereof, for example, is isolated as described herein. 
For example, recombinant protein is produced in a transformed cell line. An inbred 
strain of mice, such as Balb/c, is immunized with the protein or a peptide using a 
standard adjuvant, such as Freund's adjuvant, and a standard mouse immunization 

0 protocol. Alternatively, a synthetic peptide derived from the sequences disclosed 
herein and conjugated to a carrier protein can be used an immunogen. Polyclonal 
sera are collected and titered against the immunogen protein in an immunoassay, for 
example, a solid phase immunoassay with the immunogen immobilized on a solid 
support. Polyclonal antisera with a titer of 10 4 or greater are selected and tested for 

5 their cross reactivity against non-cancer-associated proteins, using a competitive 

binding immunoassay such as the one described in Harlow and Lane, supra, at pages 
570-573 and below. 

B. Immunological Binding Assays 

In a preferred embodiment, a protein of interest is detected and/or 

0 quantified using any of a number of well recognized immunological binding assays 

{see, e.g., U.S. Patents 4,366,241; 4,376,110; 4,517,288; and 4,837,168). For a 

review of the general immunoassays, see also Methods in Cell Biology Volume 37: 
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Antibodies in Cell Biology, Asai, ed. Academic Press, Inc. New York (1993); Basic 
and Clinical Immunology 7th Edition, Stites & Terr, eds. (1991). Immunological 
binding assays (or immunoassays) typically utilize a "capture agent" to specifically 
bind to and often immobilize the analyte (in this case the cancer-associated protein 
5 or antigenic subsequence thereof). The capture agent is a moiety that specifically 
binds to the analyte. In a preferred embodiment, the capture agent is an antibody 
that specifically binds, for example, the cancer-associated protein. The antibody 
{e.g., anti-cancer-associated protein antibody) may be produced by any of a number 
of means well known to those of skill in the art and as described above. 

10 Immunoassays also often utilize a labeling agent to specifically bind 

to and label the binding complex formed by the capture agent and the analyte. The 
labeling agent may itself be one of the moieties comprising the antibody/analyte 
complex. Thus, the labeling agent may be a labeled cancer-associated protein 
polypeptide or a labeled anti-cancer-associated protein antibody. Alternatively, the 

15 labeling agent may be a third moiety, such as another antibody, that specifically 
binds to the antibody/protein complex. 

In a preferred embodiment, the labeling agent is a second antibody 
bearing a label. Alternatively, the second antibody may lack a label, but it may, in 
turn, be bound by a labeled third antibody specific to antibodies of the species from 

20 which the second antibody is derived. The second antibody can be modified with a 
detectable moiety, such as biotin, to which a third labeled molecule can specifically 
bind, such as enzyme-labeled streptavidin. 

Other proteins capable of specifically binding immunoglobulin 
constant regions, such as protein A or protein G, can also be used as the label agent. 

25 These proteins are normal constituents of the cell walls of streptococcal bacteria. 
They exhibit a strong non-immunogenic reactivity with immunoglobulin constant 
regions from a variety of species {see, generally, Kronval, et al. (1973) J. Immunol, 
111: 1401-1406, and Akerstrom, et al (1985)/. Immunol., 135: 2589-2542). 

Throughout the assays, incubation and/or washing steps may be 

30 required after each combination of reagents. Incubation steps can vary from about 5 
seconds to several hours, preferably from about 5 minutes to about 24 hours. 
However, the incubation time will depend upon the assay format, analyte, volume of 
solution, concentrations, and the like. Usually, the assays will be carried out at 
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ambient temperature, although they can be conducted over a range of temperatures, 
such as 10°C to 40°C. 



1. Non-Competitive Assay Formats 

Immunoassays for detecting proteins of interest from tissue samples 
5 may be either competitive or noncompetitive. Noncompetitive immunoassays are 
assays in which the amount of captured analyte (in this case the protein) is directly 
measured. In one preferred "sandwich" assay, for example, the capture agent (e.g., 
anti-cancer-associated protein antibodies) can be bound directly to a solid substrate 
where it is immobilized. These immobilized antibodies then capture the cancer- 

10 associated protein present in the test sample. The cancer-associated protein thus 
immobilized is then bound by a labeling agent, such as a second cancer-associated 
protein antibody bearing a label. Alternatively, the second antibody may lack a 
label, but it may, in turn, be bound by a labeled third antibody specific to antibodies 
of the species from which the second antibody is derived. The second can be 

15 modified with a detectable moiety, such as biotin, to which a third labeled molecule 
can specifically bind, such as enzyme-labeled streptavidin. 

2. Competitive Assay Formats 

In competitive assays, the amount of target protein (analyte) present 
in the sample is measured indirectly by measuring the amount of an added 

20 (exogenous) analyte (i.e., the cancer- associated protein of interest) displaced (or 
competed away) from a capture agent (anti-cancer-associated protein antibody) by 
the analyte present in the sample. In one competitive assay, a known amount of, in 
this case, the protein of interest is added to the sample and the sample is then 
contacted with a capture agent, in this case an antibody that specifically binds to the 

25 cancer-associated protein. The amount of cancer-associated protein bound to the 
antibody is inversely proportional to the concentration of cancer-associated protein 
present in the sample. In a particularly preferred embodiment, the antibody is 
immobilized on a solid substrate. The amount of the cancer-associated protein 
bound to the antibody may be determined either by measuring the amount of subject 

30 protein present in a cancer- associated protein/antibody complex or, alternatively, by 
measuring the amount of remaining uncomplexed protein. The amount of cancer- 
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associated protein may be detected by providing a labeled cancer-associated protein 
molecule. 

A hapten inhibition assay is another preferred competitive assay. In 
this assay, a known analyte, in this case the target protein, is immobilized on a solid 
5 substrate. A known amount of anti-cancer-associated protein antibody is added to 
the sample, and the sample is then contacted with the immobilized target. In this 
case, the amount of anti-cancer-associated protein antibody bound to the 
immobilized cancer-associated protein is inversely proportional to the amount of 
cancer-associated protein present in the sample. Again, the amount of immobilized 

10 antibody may be detected by detecting either the immobilized fraction of antibody or 
the fraction of the antibody that remains in solution. Detection may be direct where 
the antibody is labeled or indirect by the subsequent addition of a labeled moiety 
that specifically binds to the antibody as described above. 

Immunoassays in the competitive binding format can be used for 

1 5 crossreactivity determinations. For example, the protein encoded by the sequences 
described herein can be immobilized to a solid support. Proteins are added to the 
assay which compete with the binding of the antisera to the immobilized antigen. 
The ability of the above proteins to compete with the binding of the antisera to the 
immobilized protein is compared to the protein encoded by any of the sequences 

20 described herein. The percent crossreactivity for the above proteins is calculated, 
using standard calculations. Those antisera with less than 10% crossreactivity with 
each of the proteins listed above are selected and pooled. The cross-reacting 
antibodies are optionally removed from the pooled antisera by immunoabsorption 
with the considered proteins, e.g., distantly related homologues. 

25 The immunoabsorbed and pooled antisera are then used in a 

competitive binding immunoassay as described above to compare a second protein, 
thought to be perhaps the protein of this invention, to the immunogen protein. In 
order to make this comparison, the two proteins are each assayed at a wide range of 
concentrations and the amount of each protein required to inhibit 50% of the binding 

30 of the antisera to the immobilized protein is determined. If the amount of the second 
protein required is less than 10 times the amount of the protein partially encoded by 
a sequence herein that is required, then the second protein is said to specifically bind 
to an antibody generated to an immunogen consisting of the target protein. 
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3. Other Assay Formats 

In a particularly preferred embodiment, Western blot (immunoblot) 
analysis is used to detect and quantify the presence of cancer- associated protein in 
the sample. The technique generally comprises separating sample proteins by gel 

5 electrophoresis on the basis of molecular weight, transferring the separated proteins 
to a suitable solid support (such as a nitrocellulose filter, a nylon filter, or 
derivatized nylon filter) and incubating the sample with the antibodies that 
specifically bind the protein of interest. For example, the anti-cancer-associated 
protein antibodies specifically bind to the cancer-associated protein on the solid 

0 support. These antibodies may be directly labeled or alternatively may be 

subsequently detected using labeled antibodies (e.g., labeled sheep anti-mouse 
antibodies) that specifically bind to the antibodies against the protein of interest. 

Other assay formats include liposome immunoassays (LIA), which 
use liposomes designed to bind specific molecules (e.g., antibodies) and release 

5 encapsulated reagents or markers. The released chemicals are then detected 

according to standard techniques (see, Monroe et al. (1986) Amer. Clin. Prod. Rev. 
5:34-41). 

4. Reduction of Non-Specific Binding 

One of skill in the art will appreciate that it is often desirable to use 
0 non-specific binding in immunoassays. Particularly, where the assay involves an 
antigen or antibody immobilized on a solid substrate it is desirable to minimize the 
amount of non-specific binding to the substrate. Means of using such non-specific 
binding are well known to those of skill in the art. Typically, this involves coating 
the substrate with a proteinaceous composition. In particular, protein compositions, 
5 such as bovine serum albumin (BSA), nonfat powdered milk and gelatin, are widely 
used with powdered milk being most preferred. 

5. Labels 

The particular label or detectable group used in the assay is not a 
critical aspect of the invention, so long as it does not significantly interfere with the 
0 specific binding of the antibody used in the assay. The detectable group can be any 
material having a detectable physical or chemical property. Such detectable labels 
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have been well-developed in the field of immunoassays and, in general, most labels 
useful in such methods can be applied to the present invention. Thus, a label is any 
composition detectable by spectroscopic, photochemical, biochemical, 
immunochemical, electrical, optical or chemical means. Useful labels in the present 

5 invention include magnetic beads (e.g., Dynabeads™), fluorescent dyes (e.g., 

fluorescein isothiocyanate, Texas red, rhodamine, and the like), radiolabels (e.g., 3 H, 
125 1, 35 S, 14 C, or 32 P), enzymes (e.g., horse radish peroxidase, alkaline phosphatase 
and others commonly used in an ELISA), and colorimetric labels such as colloidal 
gold or colored glass or plastic (e.g., polystyrene, polypropylene, latex, etc.) beads. 

0 The label may be coupled directly or indirectly to the desired 

component of the assay according to methods well known in the art. As indicated 
above, a wide variety of labels may be used, with the choice of label depending on 
sensitivity required, ease of conjugation with the compound, stability requirements, 
available instrumentation, and disposal provisions. 

5 Non-radioactive labels are often attached by indirect means. 

Generally, a ligand molecule (e.g., biotin) is covalently bound to the molecule. The 
ligand then binds to an anti-ligand (e.g., streptavidin) molecule which is either 
inherently detectable or covalently bound to a signal system, such as a detectable 
enzyme, a fluorescent compound, or a chemiluminescent compound. A number of 

0 ligands and anti-ligands can be used. Thyroxine, and Cortisol can be used in 

conjunction with the labeled, naturally occurring anti-ligands. Alternatively, any 
haptenic or antigenic compound can be used in combination with an antibody. 

The molecules can also be conjugated directly to signal generating 
compounds, e.g., by conjugation with an enzyme or fluorophore. Enzymes of 

5 interest as labels will primarily be hydrolases, particularly phosphatases, esterases 
and glycosidases, or oxidotases, particularly peroxidases. Fluorescent compounds 
include fluorescein and its derivatives, rhodamine and its derivatives, dansyl, 
umbelliferone, etc. Chemiluminescent compounds include luciferin, and 2,3- 
dihydrophthalazinediones, e.g., luminol. For a review of various labeling or signal 

3 producing systems which may be used, see, U.S. Patent No. 4,391,904). 

Means of detecting labels are well known to those of skill in the art. 
Thus, for example, where the label is a radioactive label, means for detection include 
a scintillation counter or photographic film as in autoradiography. Where the label 
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is a fluorescent label, it may be detected by exciting the fluorochrome with the 
appropriate wavelength of light and detecting the resulting fluorescence. The 
fluorescence may be detected visually, by means of photographic film, by the use of 
electronic detectors such as charge coupled devices (CCDs) or photomultipliers and 

5 the like. Similarly, enzymatic labels may be detected by providing the appropriate 
substrates for the enzyme and detecting the resulting reaction product. Finally 
simple colorimetric labels may be detected directly by observing the color associated 
with the label. Thus, in various dipstick assays, conjugated gold often appears pink, 
while various conjugated beads appear the color of the bead. 

0 Some assay formats do not require the use of labeled components. 

For instance, agglutination assays can be used to detect the presence of the target 
antibodies. In this case, antigen-coated particles are agglutinated by samples 
comprising the target antibodies. In this format, none of the components need be 
labeled and the presence of the target antibody is detected by simple visual 

5 inspection. 



III. Screening for Modulators of the Cancer Development Process 

The invention also provides methods for identifying compounds that 
modulate cancer development, e.g., loss of proliferation control, cell transformation, 
development of malignancy and/or metastasis. For example, the methods can 
identify compounds that increase or decrease the expression level of genes 
associated with cancer (e.g., cell proliferation, neoplasia, cell transformation, 
malignancy, metastatic ability, etc.) and cancer-related conditions. Although the 
following discussion is directed to methods for screening for modulators of cancer 
development, similar methods can be used to screen for modulators of cell 
proliferation, neoplasia, cell transformation or for modulators of different types of 
cancer (e.g., liver, kidney, breast, prostate, colon, etc.). 

For instance, compounds that are identified as modulators of cancer 
development using the methods of the invention find use both in vitro and in vivo. 
For example, one can treat cell cultures with the modulators in experiments designed 
to determine the mechanisms by which cancer development is regulated. 
Compounds that increase tumor development and/or induce neoplasia are useful for 
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extending the life of cell cultures that are used for production of biological products 
such as recombinant proteins. In vivo uses of compounds that delay cancer 
development include, for example, delaying the loss of cell proliferation control 
and/or cell transformation and treating conditions associated with neoplasia, 

5 malignant tumor formation and/or metastasis. 

The methods typically involve culturing a cell in the presence of a 
potential modulator to form a first cell culture. RNA (or cDNA) from the first cell 
culture is contacted with a probe which comprises a polynucleotide sequence 
associated with cancer. The amount of the probe which hybridizes to the RNA (or 

0 cDNA) from the first cell culture is determined. Typically, one determines whether 
the amount of probe which hybridizes to the RNA (or cDNA) is increased or 
decreased relative to the amount of the probe which hybridizes to RNA (or cDNA) 
from a second cell culture grown in the absence of the modulator. 

It may be further determined whether the modulator-induced increase 

5 or decrease in RNA (or cDNA) levels of the target sequence is correlated with any 
cancer-associated change in cellular phenotype. For example, a liver, kidney, breast, 
colon or prostate cell population that is treated with a modulator which induces 
decreased expression of a gene that is normally upregulated with cancer, or a liver, 
kidney, breast, colon or prostate cell that is treated with a modulator which induces 

3 increased expression of a gene that is normally downregulated with cancer, may be 
further tested for loss of control of cell proliferation, which is reflective, for 
example, of a neoplastic phenotype, for cell transformation or for metastatic ability. 

Essentially any chemical compound can be used as a potential 
modulator in the assays of the invention, although most often compounds can be 

5 dissolved in aqueous or organic (for example, DMSO-based) solutions are used. 
The assays are designed to screen large chemical libraries by automating the assay 
steps and providing compounds from any convenient source to assays, which are 
typically run in parallel (e.g., in microtiter formats on microtiter plates in robotic 
assays). It will be appreciated that there are many suppliers of chemical compounds, 

3 including Sigma (St. Louis, MO), Aldrich (St. Louis, MO), Sigma- Aldrich (St. 
Louis, MO), Fluka Chemika-Biochemica Analytika (Buchs Switzerland) and the 
like. 
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In one preferred embodiment, high throughput screening methods 
involve providing a combinatorial library containing a large number of potential 
therapeutic compounds (potential modulator compounds). Such "combinatorial 
chemical libraries" are then screened in one or more assays, as described herein, to 

5 identify those library members (particular chemical species or subclasses) that 

display a desired characteristic activity. The compounds thus identified can serve as 
conventional "lead compounds" or can themselves be used as potential or actual 
therapeutics. 

A combinatorial chemical library is a collection of diverse chemical 

0 compounds generated by either chemical synthesis or biological synthesis, by 

combining a number of chemical "building blocks" such as reagents. For example, a 
linear combinatorial chemical library such as a polypeptide library is formed by 
combining a set of chemical building blocks (amino acids) in every possible way for 
a given compound length (i.e., the number of amino acids in a polypeptide 

5 compound). Millions of chemical compounds can be synthesized through such 
combinatorial mixing of chemical building blocks. 

Preparation and screening of combinatorial chemical libraries is well 
known to those of skill in the art. Such combinatorial chemical libraries include, but 
are not limited to, peptide libraries (see, e.g., U.S. Patent 5,010,175, Furka, Int. J. 

3 Pept. Prot. Res. 37:487-493 (1991) and Houghton et aL, Nature 354:84-88 (1991)). 
Other chemistries for generating chemical diversity libraries can also be used. Such 
chemistries include, but are not limited to: peptoids (PCT Publication No. WO 
91/19735), encoded peptides (PCT Publication WO 93/20242), random bio- 
oligomers (PCT Publication No. WO 92/00091), benzodiazepines (U.S. Pat. No. 

5 5,288,514), diversomers such as hydantoins, benzodiazepines and dipeptides (Hobbs 
etaL, Proc. Nat. Acad. Sci. USA 90:6909-6913 (1993)), vinylogous polypeptides 
(Hagihara et aL, J. Amer. Chem. Soc. 1 14:6568 (1992)), nonpeptidal 
peptidomimetics with P-D-glucose scaffolding (Hirschmann et aL, J. Amer. Chem. 
Soc. 114:9217-9218 (1992)), analogous organic syntheses of small compound 

) libraries (Chen et al.,J. Amer. Chem. Soc. 1 16:2661 (1994)), oligocarbamates (Cho 
et aL, Science 261 :1303 (1993)), and/or peptidyl phosphonates (Campbell et aL, J. 
Org. Chem. 59:658 (1994)), nucleic acid libraries (see, Ausubel, Berger and 
Sambrook, all supra), peptide nucleic acid libraries (see, e.g., U.S. Patent 
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5,539,083), antibody libraries {see, e.g., Vaughn et al, Nature Biotechnology \ 
14(3):309-314 (1996) and PCT/US96/1 0287), carbohydrate libraries (see, e.g., Liang 
et aL, Science, 274:1520-1522 (1996) and U.S. Patent 5,593,853), small organic 
molecule libraries (see, e.g., benzodiazepines, Baum C&EN, Jan 18, page 33 (1993); 

5 isoprenoids, U.S. Patent 5,569,588; thiazolidinones and metathiazanones, U.S. 
Patent 5,549,974; pyrrolidines, U.S. Patents 5,525,735 and 5,519,134; morpholino 
compounds, U.S. Patent 5,506,337; benzodiazepines, 5,288,514, and the like. 

Devices for the preparation of combinatorial libraries are 
commercially available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, 

0 Louisville KY, Symphony, Rainin, Woburn, MA, 43 3 A Applied Biosystems, Foster 
City, CA, 9050 Plus, Millipore, Bedford, MA). In addition, numerous combinatorial 
libraries are themselves commercially available (see, e.g., ComGenex, Princeton, 
N.J., Asinex, Moscow, Ru, Tripos, Inc., St. Louis, MO, ChemStar, Ltd, Moscow, 
RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, Columbia, MD, etc.). 

5 As noted, the invention provides in vitro assays for identifying, in a 

high throughput format, compounds that can modulate cancer development. Control 
reactions that measure the transformation or malignancy level of the cell in a 
reaction that does not include a potential modulator are optional, as the assays are 
highly uniform. Such optional control reactions are appropriate and increase the 

0 reliability of the assay. Accordingly, in a preferred embodiment, the methods of the 
invention include such a control reaction. For each of the assay formats described, 
"no modulator" control reactions which do not include a modulator provide a 
background level of binding activity. 

In some assays it will be desirable to have positive controls to ensure 

5 that the components of the assays are working properly. At least two types of 
positive controls are appropriate. First, a known activator of cancer development 
can be incubated with one sample of the assay, and the resulting increase in signal 
resulting from an increased expression level of a gene associated with cancer 
determined according to the methods herein. Second, a known inhibitor of cancer 

0 development can be added, and the resulting decrease in signal for the expression of 
a gene associated with cancer similarly detected. It will be appreciated that 
modulators can also be combined with activators or inhibitors to find modulators 
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which inhibit the increase or decrease that is otherwise caused by the presence of the 
known modulator of cancer development. 

In the high throughput assays of the invention, it is possible to screen 
up to several thousand different modulators in a single day. In particular, each well 

5 of a microtiter plate can be used to run a separate assay against a selected potential 
modulator, or, if concentration or incubation time effects are to be observed, every 
5-10 wells can test a single modulator. Thus, a single standard microtiter plate can 
assay about 100 (96) modulators. If 1536 well plates are used, then a single plate 
can easily assay from about 100- about 1500 different compounds. It is possible to 

0 assay many different plates per day; assay screens for up to about 6,000-20,000, and 
even up to about 100,000 different compounds is possible using the integrated 
systems of the invention. 



IV. Compositions, Kits and Integrated Systems 

The invention provides compositions, kits and integrated systems for 

5 practicing the assays described herein. Although the following discussion is 

directed to kits for carrying out assays using nucleic acids (or proteins, antibodies, 
etc.) associated with cancer, similar kits can be assembled for carrying out assays 
using nucleic acids (or proteins, antibodies, etc.) associated with cell proliferation, 
neoplasia, cell transformation, malignancy, metastasis and/or nucleic acids 

0 associated with different types of cancer {e.g., liver, kidney, breast, prostate, colon, 
etc.). For instance, an assay composition having a nucleic acid associated with, for 
example, malignancy of a cell and a labeling reagent is provided by the present 
invention. In preferred embodiments, a plurality of, for example, cancer-associated 
nucleic acids are provided in the assay compositions. The invention also provides 

5 assay compositions for use in solid phase assays; such compositions can include, for 
example, one or more cancer-associated nucleic acids immobilized on a solid 
support, and a labeling reagent. In each case, the assay compositions can also 
include additional reagents that are desirable for hybridization. Modulators of 
expression of, for example, cancer-associated nucleic acids can also be included in 

0 the assay compositions. 
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The invention also provides kits for carrying out the assays of the 
invention. The kits typically include a probe which comprises a polynucleotide 
sequence associated with cancer; and a label for detecting the presence of the probe. 
Preferably, the kits will include a plurality of polynucleotide sequences associated 
5 with cancer. Kits can include any of the compositions noted above, and optionally 
further include additional components such as instructions to practice a high- 
throughput method of assaying for an effect on cell proliferation and transformation 
and expression of cancer-associated genes, one or more containers or compartments 
(e.g., to hold the probe, labels, or the like), a control modulator of cancer 
10 development, a robotic armature for mixing kit components or the like. 

The invention also provides integrated systems for high-throughput 
screening of potential modulators for an effect on cancer development. The systems 
typically include a robotic armature which transfers fluid from a source to a 
destination, a controller which controls the robotic armature, a label detector, a data 
1 5 storage unit which records label detection, and an assay component such as a 
microtiter dish comprising a well having a reaction mixture or a substrate 
comprising a fixed nucleic acid or immobilization moiety. 

A number of robotic fluid transfer systems are available, or can easily 
be made from existing components. For example, a Zymate XP (Zymark 
20 Corporation; Hopkinton, MA) automated robot using a Microlab 2200 (Hamilton; 
Reno, NV) pipetting station can be used to transfer parallel samples to 96 well 
microtiter plates to set up several parallel simultaneous STAT binding assays. 

Optical images viewed (and, optionally, recorded) by a camera or 
other recording device (e.g., a photodiode and data storage device) are optionally 
25 further processed in any of the embodiments herein, e.g., by digitizing the image and 
storing and analyzing the image on a computer. A variety of commercially available 
peripheral equipment and software is available for digitizing, storing and analyzing a 
digitized video or digitized optical image, e.g., using PC (Intel x86 or Pentium chip- 
compatible DOS®, OS2® WINDOWS®, WINDOWS NT® or WINDOWS95® based 
30 computers), MACINTOSH®, or UNIX® based (e.g., SUN® work station) computers. 

One conventional system carries light from the specimen field to a 
cooled charge-coupled device (CCD) camera, in common use in the art. A CCD 
camera includes an array of picture elements (pixels). The light from the specimen 
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is imaged on the CCD. Particular pixels corresponding to regions of the specimen 
(e.g., individual hybridization sites on an array of biological polymers) are sampled 
to obtain light intensity readings for each position. Multiple pixels are processed in 
parallel to increase speed. The apparatus and methods of the invention are easily 

5 used for viewing any sample, e.g., by fluorescent or dark field microscopic 
techniques. 



V. Gene Therapy Applications 

A variety of human diseases can be treated by therapeutic approaches 
that involve stably introducing a gene into a human cell such that the gene is 

10 transcribed and the gene product is produced in the cell. Diseases amenable to 

treatment by this approach include inherited diseases, including those in which the 
defect is in a single gene. Gene therapy is also useful for treatment of acquired 
diseases and other conditions. For discussions on the application of gene therapy 
towards the treatment of genetic as well as acquired diseases, see, Miller, A.D. 

15 (1992) Nature 357:455-460, and Mulligan, R.C. (1993) Science 260:926-932, both 
of which are incorporated herein by reference. 

A. Vectors for Gene Delivery 

For delivery to a cell or organism, the nucleic acids of the invention 
can be incorporated into a vector. Examples of vectors used for such purposes 

20 include expression plasmids capable of directing the expression of the nucleic acids 
in the target cell. In other instances, the vector is a viral vector system wherein the 
nucleic acids are incorporated into a viral genome that is capable of transfecting the 
target cell. In a preferred embodiment, the nucleic acids can be operably linked to 
expression and control sequences that can direct expression of the gene in the 

25 desired target host cells. Thus, one can achieve expression of the nucleic acid under 
appropriate conditions in the target cell. 

B. Gene Delivery Systems 



Viral vector systems useful in the expression of the nucleic acids 
include, for example, naturally occurring or recombinant viral vector systems. 
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Depending upon the particular application, suitable viral vectors include replication 
competent, replication deficient, and conditionally replicating viral vectors. For 
example, viral vectors can be derived from the genome of human or bovine 
adenoviruses, vaccinia virus, herpes virus, adeno-associated virus, minute virus of 

5 mice (MVM), HIV, sindbis virus, and retroviruses (including but not limited to Rous 
sarcoma virus), and MoMLV. Typically, genes of interest are inserted into such 
vectors to allow packaging of the gene construct, typically with accompanying viral 
DNA, followed by infection of a sensitive host cell and expression of the gene of 
interest. 

10 As used herein, "gene delivery system" refers to any means for the 

delivery of a nucleic acid of the invention to a target cell. In some embodiments of 
the invention, nucleic acids are conjugated to a cell receptor ligand for facilitated 
uptake (e.g., invagination of coated pits and internalization of the endosome) 
through an appropriate linking moiety, such as a DNA linking moiety (Wu et ah, J. 

15 Biol. Chem. 263:14621-14624 (1988); WO 92/06180). For example, nucleic acids 
can be linked through a polylysine moiety to asialo-oromucocid, which is a ligand 
for the asialoglycoprotein receptor of hepatocytes. 

Similarly, viral envelopes used for packaging gene constructs that 
include the nucleic acids of the invention can be modified by the addition of receptor 

20 ligands or antibodies specific for a receptor to permit receptor-mediated endocytosis 
into specific cells (see, e.g., WO 93/20221, WO 93/14188, WO 94/06923). In some 
embodiments of the invention, the DNA constructs of the invention are linked to 
viral proteins, such as adenovirus particles, to facilitate endocytosis (Curiel et al. 9 
Proc. Natl. Acad. Sci. U.S.A. 88: 8850-8854 (1991)). In other embodiments, 

25 molecular conjugates of the instant invention can include microtubule inhibitors 

(WO/9406922); synthetic peptides mimicking influenza virus hemagglutinin (Plank 
et al, J. Biol. Chem. 269:12918-12924 (1994)); and nuclear localization signals such 
as SV40 T antigen (W093/19768). 

Retroviral vectors are also useful for introducing the nucleic acids of 

30 the invention into target cells or organisms. Retroviral vectors are produced by 
genetically manipulating retroviruses. The viral genome of retroviruses is RNA. 
Upon infection, this genomic RNA is reverse transcribed into a DNA copy which is 
integrated into the chromosomal DNA of transduced cells with a high degree of 
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stability and efficiency. The integrated DNA copy is referred to as a provirus and is 
inherited by daughter cells as is any other gene. The wild type retroviral genome 
and the proviral DNA have three genes: the gag, the pol and the env genes, which 
are flanked by two long terminal repeat (LTR) sequences. The gag gene encodes the 
5 internal structural (nucleocapsid) proteins; the pol gene encodes the RNA directed 
DNA polymerase (reverse transcriptase); and the env gene encodes viral envelope 
glycoproteins. The 5 f and 3' LTRs serve to promote transcription and 
polyadenylation of virion RNAs. Adjacent to the 5 f LTR are sequences necessary 
for reverse transcription of the genome (the tRNA primer binding site) and for 

10 efficient encapsulation of viral RNA into particles (the Psi site). See, Mulligan, 

R.C., In: Experimental Manipulation of Gene Expression, M. Inouye (ed), 155-173 
(1983); Mann, R., etal, Cell, 33:153-159 (1983); Cone, R.D. and R.C. Mulligan, 
Proceedings of the National Academy of Sciences, U.S.A., 81 :6349-6353 (1984). 

The design of retroviral vectors is well known to those of ordinary 

15 skill in the art. See, e.g., Singer, M. and Berg, P., supra. In brief, if the sequences 
necessary for encapsidation (or packaging of retroviral RNA into infectious virions) 
are missing from the viral genome, the result is a cis acting defect which prevents 
encapsidation of genomic RNA. However, the resulting mutant is still capable of 
directing the synthesis of all virion proteins. Retroviral genomes from which these 

20 sequences have been deleted, as well as cell lines containing the mutant genome 
stably integrated into the chromosome are well known in the art and are used to 
construct retroviral vectors. Preparation of retroviral vectors and their uses are 
described in many publications including European Patent Application EPA 0 178 
220, U.S. Patent 4,405,712, Gilboa, Biotechniques 4:504-512 (1986), Mann, et al, 

25 Cell 33:153-159 (1983), Cone and Mulligan, Proc. Natl Acad. Sci. USA 

81 :6349-6353 (1984), Eglitis, M.A, et al. (1988) Biotechniques 6:608-614, Miller, 
A.D. et al. (1989) Biotechniques 7:981-990, Miller, A.D.(1992) Nature, supra, 
Mulligan, R.C. (1993), supra, and Gould, B. et al, and International Publication No. 
WO 92/07943 entitled "Retroviral Vectors Useful in Gene Therapy". The teachings 

30 of these patents and publications are incorporated herein by reference. 

The retroviral vector particles are prepared by recombinantly 
inserting the desired nucleotide sequence into a retrovirus vector and packaging the 
vector with retroviral capsid proteins by use of a packaging cell line. The resultant 
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retroviral vector particle is incapable of replication in the host cell but is capable of 
integrating into the host cell genome as a proviral sequence containing the desired 
nucleotide sequence. As a result, the patient is capable of producing, for example, 
the cancer-associated protein and thus restore the cells to a normal, non-cancerous 

5 phenotype. 

Packaging cell lines that are used to prepare the retroviral vector 
particles are typically recombinant mammalian tissue culture cell lines that produce 
the necessary viral structural proteins required for packaging, but which are 
incapable of producing infectious virions. The defective retroviral vectors that are 
10 used, on the other hand, lack these structural genes but encode the remaining 
proteins necessary for packaging. To prepare a packaging cell line, one can 
construct an infectious clone of a desired retrovirus in which the packaging site has 
been deleted. Cells comprising this construct will express all structural viral 
proteins, but the introduced DNA will be incapable of being packaged. 
15 Alternatively, packaging cell lines can be produced by transforming a cell line with 
one or more expression plasmids encoding the appropriate core and envelope 
proteins. In these cells, the gag 9 pol, and env genes can be derived from the same or 
different retroviruses. 

A number of packaging cell lines suitable for the present invention 
20 are also available in the prior art. Examples of these cell lines include Crip, GPE86, 
PA317 and PG13. See Miller et aL, J. Virol. 65:2220-2224 (1991), which is 
incorporated herein by reference. Examples of other packaging cell lines are 
described in Cone, R. and Mulligan, R.C., Proceedings of the National Academy of 
Sciences, USA, 81:6349-6353 (1984) and in Danos, O. and R.C. Mulligan, 
25 Proceedings of the National Academy of Sciences, USA, 85: 6460-6464 (1988), 
Eglitis, M.A., et aL (1988), supra, and Miller, A.D., (1990), supra, also all 
incorporated herein by reference. 

Packaging cell lines capable of producing retroviral vector particles 
with chimeric envelope proteins may be used. Alternatively, amphotropic or 
30 xenotropic envelope proteins, such as those produced by PA3 17 and GPX packaging 
cell lines may be used to package the retroviral vectors. 

In some embodiments of the invention, an antisense nucleic acid is 
administered which hybridizes to an gene associated with cancer development or to 
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a transcript thereof. The antisense nucleic acid can be provided as an antisense 
oligonucleotide {see, e.g., Murayama et aL, Antisense Nucleic Acid Drug Dev. 
7:109-1 14 (1997)). Genes encoding an antisense nucleic acid can also be provided; 
such genes can be introduced into cells by methods known to those of skill in the art. 
5 For example, one can introduce a gene that encodes an antisense nucleic acid in a 
viral vector, such as, for example, in hepatitis B virus {see, e.g., Ji et aL, J. Viral 
Hepat. 4:167-173 (1997)), in adeno-associated virus {see, e.g., Xiao et aL, Brain 
Res. 756:76-83 (1997)), or in other systems including, but not limited, to an HVJ 
(Sendai virus)-liposome gene delivery system {see, e.g., Kaneda et aL, Ann. NY. 

10 Acad. Sci. 811 :299-308 (1997)), a "peptide vector" {see, e.g., Vidal et al, CR Acad. 
Sci 7/732:279-287 (1997)), as a gene in an episomal or plasmid vector {see, e.g., 
Cooper et al, Proc. Natl. Acad. Sci. U.S.A. 94:6450-6455 (1997), Yew et al. Hum 
Gene Ther. 8:575-584 (1997)), as a gene in a peptide-DNA aggregate {see, e.g., 
Niidome et al, J. Biol. Chem. 272:15307-15312 (1997)), as "naked DNA" {see t e.g., 

15 U.S. 5,580,859 and U.S. 5,589,466), in lipidic vector systems {see, e.g., Lee et al, 
Crit Rev Ther Drug Carrier Syst. 14:173-206 (1997)), polymer coated liposomes 
(Marin et al, United States Patent No. 5,213,804, issued May 25, 1993; Woodle et 
aL, United States Patent No. 5,013,556, issued May 7, 1991), cationic liposomes 
(Epand et aL, United States Patent No. 5,283,185, issued February 1, 1994; Jessee, 

20 J. A., United States Patent No. 5,578,475, issued November 26, 1996; Rose et al, 
United States Patent No. 5,279,833, issued January 18, 1994; Gebeyehu et al, 
United States Patent No. 5,334,761, issued August 2, 1994), gas filled microspheres 
(Unger et al., United States Patent No. 5,542,935, issued August 6, 1996), ligand- 
targeted encapsulated macromolecules (Low et al. United States Patent No. 

25 5,108,921, issued April 28, 1992; Curiel et al., United States Patent No. 5,521,291, 
issued May 28, 1996; Groman et aL, United States Patent No. 5,554,386, issued 
September 10, 1996; Wu et aL, United States Patent No. 5,166,320, issued 
November 24, 1992). 

C. Pharmaceutical Formulations 

30 When used for pharmaceutical purposes, the vectors used for gene 

therapy are formulated in a suitable buffer, which can be any pharmaceutically 

acceptable buffer, such as phosphate buffered saline or sodium phosphate/sodium 
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sulfate, Tris buffer, glycine buffer, sterile water, and other buffers known to the 
ordinarily skilled artisan such as those described by Good et al. (1966) Biochemistry 
5:467. 

The compositions can additionally include a stabilizer, enhancer or 
5 other pharmaceutical^ acceptable carriers or vehicles. A pharmaceutical^ 

acceptable carrier can contain a physiologically acceptable compound that acts, for 
example, to stabilize the nucleic acids of the invention and any associated vector. A 
physiologically acceptable compound can include, for example, carbohydrates, such 
as glucose, sucrose or dextrans, antioxidants, such as ascorbic acid or glutathione, 

10 chelating agents, low molecular weight proteins or other stabilizers or excipients. 
Other physiologically acceptable compounds include wetting agents, emulsifying 
agents, dispersing agents or preservatives, which are particularly useful for 
preventing the growth or action of microorganisms. Various preservatives are well 
known and include, for example, phenol and ascorbic acid. Examples of carriers, 

15 stabilizers or adjuvants can be found in Martin, Remington's Pharm. ScL, 15th Ed. 
(Mack Publ. Co., Easton, PA 1975), which is incorporated herein by reference. 

D. Administration of Formulations 

The formulations of the invention can be delivered to any tissue or 
organ using any delivery method known to the ordinarily skilled artisan for example. 

20 In some embodiments of the invention, the nucleic acids of the invention are 
formulated in mucosal, topical, and/or buccal formulations, particularly 
mucoadhesive gel and topical gel formulations. Exemplary permeation enhancing 
compositions, polymer matrices, and mucoadhesive gel preparations for transdermal 
delivery are disclosed in U.S. Patent No. 5,346,701. In some embodiments of the 

25 invention, a therapeutic agent is formulated in ophthalmic formulations for 
administration to the eye. 

E. Methods of Treatment 

The gene therapy formulations of the invention are typically 
administered to a cell. The cell can be provided as part of a tissue, such as an 
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epithelial membrane, or as an isolated cell, such as in tissue culture. The cell can be 
provided in vivo, ex vivo, or in vitro. 

The formulations can be introduced into the tissue of interest in vivo 
or ex vivo by a variety of methods. In some embodiments of the invention, the 
5 nucleic acids of the invention are introduced to cells by such methods as 

microinjection, calcium phosphate precipitation, liposome fusion, or biolistics. In 
further embodiments, the nucleic acids are taken up directly by the tissue of interest. 

In some embodiments of the invention, the nucleic acids of the 
invention are administered ex vivo to cells or tissues explanted from a patient, then 
10 returned to the patient. Examples of ex vivo administration of therapeutic gene 

constructs include Arteaga et aL, Cancer Research 56(5): 1098-1 103 (1996); Nolta et 
al, Proc Natl Acad. Sci. USA 93(6):2414-9 (1996); Koc et al, Seminars in 
Oncology 23 (l):46-65 (1996); Raper et al, Annals of Surgery 223(2): 1 16-26 
(1996); Dalesandro etal,J. Thorac. Cardi. Surg., ll(2):416-22 (1996); and 
15 Makarov et ah, Proc. Natl. Acad. Set USA 93(l):402-6 (1996). 

VI. General Recombinant Nucleic Acids Methods for Use with the 
Invention 

A. General Recombinant Nucleic Acids Methods 

Nucleotide sizes are given in either kilobases (kb) or base pairs (bp). 
20 These are estimates derived from agarose or acrylamide gel electrophoresis or, 

alternatively, from published DNA sequences. 

Oligonucleotides that are not commercially available can be 

chemically synthesized according to the solid phase phosphoramidite triester method 

first described by S.L. Beaucage and M.H. Caruthers, Tetrahedron Letts., 
25 22(20): 1859- 1862 (1981), using an automated synthesizer, as described in D.R. 

Needham Van Devanter et. al, Nucleic Acids Res., 12:6159-6168, 1984. 

Purification of oligonucleotides is, for example, by either native acrylamide gel 

electrophoresis or by anion-exchange HPLC as described in J.D. Pearson and F.E. 

Reanier,./ Chrom., 255:137-149, 1983. 
30 The nucleic acids described here, or fragments thereof, can be used as 

a hybridization probe for a cDNA library to isolate the corresponding full length 
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cDNA and to isolate other cDNAs which have a high sequence similarity to the gene 
or similar biological activity. Probes of this type preferably have at least 30 bases 
and may contain, for example, 50 or more bases. The probe may also be used to 
identify a cDNA clone corresponding to a fiill length transcript and a genomic clone 
5 or clones that contain the complete gene, including regulatory and promotor regions, 
exons and introns. An example of such a screen includes isolating the coding region 
of the gene by using the known DNA sequence to synthesize an oligonucleotide 
probe. Labeled oligonucleotides having a sequence complementary to that of the 
nucleic acids of the present invention can be used to screen a library of human 

10 cDNA, genomic DNA or mRNA to determine which members of the library the 
probe hybridizes to. 

The sequence of the cloned genes and synthetic oligonucleotides can 
be verified using the chemical degradation method of A.M. Maxam et al. 9 Methods 
in Enzymology, 65:499560, (1980). The sequence can be confirmed after the 

15 assembly of the oligonucleotide fragments into the double-stranded DNA sequence 
using the method of Maxam and Gilbert, supra, or the chain termination method for 
sequencing double-stranded templates of R.B. Wallace et al t Gene, 16:21-26, 1981. 
Southern blot hybridization techniques can be carried out according to Southern et 
al. 9 J. Mol Biol, 98:503, 1975. 

20 B. Cloning Methods for the Isolation of Nucleotide Sequences 

Encoding the Desired Proteins 

In general, the nucleic acids encoding the subject proteins are cloned 
from DNA sequence libraries that are made to encode copy DNA (cDNA) or 
genomic DNA. The particular sequences can be located by hybridizing with an 

25 oligonucleotide probe, the sequence of which can be derived from the sequence 
listing provided herein, which provides a reference for PCR primers and defines 
suitable regions for isolating cancer-associated specific probes. Alternatively, where 
the sequence is cloned into an expression library, the expressed recombinant protein 
can be detected immunologically with antisera or purified antibodies made against 

30 the cancer- associated protein of interest. 

To make the cDNA library, one should choose a source that is rich in 
mRNA. The mRNA can then be made into cDNA, ligated into a recombinant 
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vector, and transfected into a recombinant host for propagation, screening and 
cloning. Methods for making and screening cDNA libraries are well known. See, 
Gubler, U. and Hoffinan, B.J., Gene 25:263-269, 1983 and Sambrook, supra. 

For a genomic library, the DNA is extracted from the tissue and 

5 either mechanically sheared or enzymatically digested to yield fragments of 
preferably about 5-100 kb. The fragments are then separated by gradient 
centrifugation from undesired sizes and are constructed in bacteriophage lambda 
vectors. These vectors and phage are packaged in vitro, as described in Sambrook, 
supra. Recombinant phages are analyzed by plaque hybridization as described in 

0 Benton and Davis, Science, 196:180-182 (1977). Colony hybridization is carried out 
as generally described in M. Grunstein et ah, Proc. Natl Acad. Sci. USA., 
72:3961-3965 (1975). 

An alternative method combines the use of synthetic oligonucleotide 
primers with polymerase extension on an mRNA or DNA template. This 

5 polymerase chain reaction (PCR) method amplifies the nucleic acids encoding the 
protein of interest directly from mRNA, cDNA, genomic libraries or cDNA 
libraries. Restriction endonuclease sites can be incorporated into the primers. 
Polymerase chain reaction or other in vitro amplification methods may also be 
useful, for example, to clone nucleic acids encoding specific proteins and express 

0 said proteins, to synthesize nucleic acids that will be used as probes for detecting the 
presence of mRNA encoding cancer-associated proteins in physiological samples, 
for nucleic acid sequencing, or for other purposes. U.S. Patent Nos. 4,683,195 and 
4,683,202 describe this method. Genes amplified by a PCR reaction can be purified 
from agarose gels and cloned into an appropriate vector. 

5 Appropriate primers and probes for identifying the genes encoding 

cancer-associated proteins from mammalian tissues are generated from comparisons 
of the sequences provided herein. For a general overview of PCR, see PCR 
Protocols: A Guide to Methods and Applications. (Innis, M, Gelfand, D., Sninsky, J. 
and White, T., eds.), Academic Press, San Diego (1990), incorporated herein by 

0 reference. 

Synthetic oligonucleotides can be used to construct genes. This is 
done using a series of overlapping oligonucleotides, usually 40-120 bp in length, 
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representing both the sense and anti-sense strands of the gene. These DNA 
fragments are then annealed, ligated and cloned. 

A gene involved in the onset of cancer, for example, can be cloned 
using intermediate vectors before transformation into mammalian cells for 

5 expression. These intermediate vectors are typically prokaryote vectors or shuttle 
vectors. The proteins can be expressed in either prokaryotes, using standard 
methods well known to those of skill in the art, or eukaryotes as described infra. 

C. Expression in Eukaryotes 

Standard eukaryotic transfection methods are used to produce 

0 eukaryotic cell lines, e.g. , yeast, insect, or mammalian cell lines, which express large 
quantities of the cancer-associated proteins which are then purified using standard 
techniques. See, e.g., Colley et al, J. Biol Chem. 264:17619-17622, (1989), and 
Guide to Protein Purification, in Vol. 182 of Methods in Enzymology (Deutscher ed., 
1990), both of which are incorporated herein by reference. 

5 Transformations of eukaryotic cells are performed according to 

standard techniques as described by D.A. Morrison, J. Bad., 132:349-351 (1977), or 
by J.E. Clark-Curtiss and R. Curtiss, Methods in Enzymology, 101:347-362, Eds. R. 
Wu et. al. 9 Academic Press, New York (1983). 

Any of the well known procedures for introducing foreign nucleotide 

0 sequences into host cells may be used. These include the use of calcium phosphate 
transfection, polybrene, protoplast fusion, electroporation, liposomes, 
microinjection, plasma vectors, viral vectors and any of the other well known 
methods for introducing cloned genomic DNA, cDNA, synthetic DNA or other 
foreign genetic material into a host cell (see Sambrook et al. 9 supra). It is only 

5 necessary that the particular genetic engineering procedure utilized be capable of 
successfully introducing at least one gene into the host cell which is capable of 
expressing the protein. 

The particular eukaryotic expression vector used to transport the 
genetic information into the cell is not particularly critical. Any of the conventional 

0 vectors used for expression in eukaryotic cells may be used. Expression vectors 

containing regulatory elements from eukaryotic viruses are typically used. SV40 

vectors include pSVT7 and pMT2. Vectors derived from bovine papilloma virus 
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include pBV-lMTHA, and vectors derived from Epstein Bar virus include pHEBO, 
and p205. Other exemplary vectors include pMSG, pAV009/A + , pMTO10/A + , 
pMAMneo-5, baculo virus pDSVE, and any other vector allowing expression of 
proteins under the direction of the S V-40 early promoter, SV-40 later promoter, 

5 metallothionein promoter, murine mammary tumor virus promoter, Rous sarcoma 
virus promoter, polyhedrin promoter, or other promoters shown effective for 
expression in eukaryotic cells. 

The vectors usually include selectable markers which result in gene 
amplification, such as thymidine kinase, aminoglycoside phosphotransferase, 

0 hygromycin B phosphotransferase, xanthine-guanine phosphoribosyl transferase, 
CAD (carbamyl phosphate synthetase, aspartate transcarbamylase, and 
dihydroorotase), adenosine deaminase, dihydrofolate reductase, and asparagine 
synthetase and ouabain selection. Alternatively, high yield expression systems not 
involving gene amplification are also suitable, such as using a baculovirus vector in 

5 insect cells, with a target protein encoding sequence under the direction of the 
polyhedrin promoter or other strong baculovirus promoters. 

The expression vector of the present invention will typically contain 
both prokaryotic sequences that facilitate the cloning of the vector in bacteria as well 
as one or more eukaryotic transcription units that are expressed only in eukaryotic 

0 cells, such as mammalian cells. The vector may or may not comprise a eukaryotic 
replicon. If a eukaryotic replicon is present, then the vector is amplifiable in 
eukaryotic cells using the appropriate selectable marker. If the vector does not 
comprise a eukaryotic replicon, no episomal amplification is possible. Instead, the 
transfected DNA integrates into the genome of the transfected cell, where the 

5 promoter directs expression of the desired gene. The expression vector is typically 
constructed from elements derived from different, well characterized viral or 
mammalian genes. For a general discussion of the expression of cloned genes in 
cultured mammalian cells, see, Sambrook et al., supra, Ch. 16. 

The prokaryotic elements that are typically included in the 

0 mammalian expression vector include a replicon that functions in E. coli, a gene 

encoding antibiotic resistance to permit selection of bacteria that harbor recombinant 
plasmids, and unique restriction sites in nonessential regions of the plasmid to allow 
insertion of eukaryotic sequences. The particular antibiotic resistance gene chosen 
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is not critical, any of the many resistance genes known in the art are suitable. The 
prokaryotic sequences are preferably chosen such that they do not interfere with the 
replication of the DNA in eukaryotic cells. 

The expression vector contains a eukaryotic transcription unit or 

5 expression cassette that contains all the elements required for the expression of the 
cancer-associated protein encoding DNA in eukaryotic cells. A typical expression 
cassette contains a promoter operably linked to the DNA sequence encoding the 
cancer-associated protein and signals required for efficient polyadenylation of the 
transcript. The DNA sequence encoding the protein may typically be linked to a 

0 cleavable signal peptide sequence to promote secretion of the encoded protein by the 
transformed cell. Such signal peptides would include, among others, the signal 
peptides from tissue plasminogen activator, insulin, and neuron growth factor, and 
juvenile hormone esterase of Heliothis virescens. Additional elements of the 
cassette may include enhancers and, if genomic DNA is used as the structural gene, 

5 introns with functional splice donor and acceptor sites. 

Eukaryotic promoters typically contain two types of recognition 
sequences, the TATA box and upstream promoter elements. The TATA box, 
located 25-30 base pairs upstream of the transcription initiation site, is thought to be 
involved in directing RNA polymerase to begin RNA synthesis. The other upstream 

0 promoter elements determine the rate at which transcription is initiated. 

Enhancer elements can stimulate transcription up to 1,000 fold from 
linked homologous or heterologous promoters. Enhancers are active when placed 
downstream or upstream from the transcription initiation site. Many enhancer 
elements derived from viruses have a broad host range and are active in a variety of 

5 tissues. For example, the SV40 early gene enhancer is suitable for many cell types. 
Other enhancer/promoter combinations that are suitable for the present invention 
include those derived from polyoma virus, human or murine cytomegalovirus, the 
long term repeat from various retroviruses such as murine leukemia virus, murine or 
Rous sarcoma virus and HIV. See, Enhancers and Eukaryotic Expression, Cold 

3 Spring Harbor Pres, Cold Spring Harbor, N.Y. 1983, which is incorporated herein 
by reference. 

In the construction of the expression cassette, the promoter is 
preferably positioned about the same distance from the heterologous transcription 
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start site as it is from the transcription start site in its natural setting. As is known in 
the art, however, some variation in this distance can be accommodated without loss 
of promoter function. 

In addition to a promoter sequence, the expression cassette should 

5 also contain a transcription termination region downstream of the structural gene to 
provide for efficient termination. The termination region may be obtained from the 
same gene as the promoter sequence or may be obtained from different genes. 

If the mRNA encoded by the structural gene is to be efficiently 
translated, polyadenylation sequences are also commonly added to the vector 

0 construct. Two distinct sequence elements are required for accurate and efficient 
polyadenylation: GU or U rich sequences located downstream from the 
polyadenylation site and a highly conserved sequence of six nucleotides, AAUAAA, 
located 1 1-30 nucleotides upstream. Termination and polyadenylation signals that 
are suitable for the present invention include those derived from SV40, or a partial 

5 genomic copy of a gene already resident on the expression vector. 

In addition to the elements already described, the expression vector of 
the present invention may typically contain other specialized elements intended to 
increase the level of expression of cloned genes or to facilitate the identification of 
cells that carry the transfected DNA. For instance, a number of animal viruses 

0 contain DNA sequences that promote the extra chromosomal replication of the viral 
genome in permissive cell types. Plasmids bearing these viral replicons are 
replicated episomally as long as the appropriate factors are provided by genes either 
carried on the plasmid or with the genome of the host cell. 

The cDNA encoding the protein of the invention can be ligated to 

5 various expression vectors for use in transforming host cell cultures. The vectors 
typically contain gene sequences to initiate transcription and translation of the 
cancer-associated gene. These sequences need to be compatible with the selected 
host cell. In addition, the vectors preferably contain a marker to provide a 
phenotypic trait for selection of transformed host cells such as dihydro folate 

3 reductase or metallothionein. Additionally, a vector might contain a replicative 
origin. 

Cells of mammalian origin are illustrative of cell cultures useful for 
the production of, for example, the cancer-associated protein. Mammalian cell 
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systems often will be in the form of monolayers of cells although mammalian cell 
suspensions may also be used. Illustrative examples of mammalian cell lines 
include VERO and HeLa cells, Chinese hamster ovary (CHO) cell lines, WI38, 
BHK, COS-7 or MDCK cell lines. NIH 3T3 or COS cells are preferred. 
5 As indicated above, the vector, e.g., a plasmid, which is used to 

transform the host cell, preferably contains DNA sequences to initiate transcription 
and sequences to control the translation of the cancer-associated protein gene 
sequence. These sequences are referred to as expression control sequences. 
Illustrative expression control sequences are obtained from the SV-40 promoter 

10 {Science, 222:524-527 (1983)), the CMV I.E. Promoter {Proc. Natl. Acad. Sci. 
81:659-663 (1984)) or the metallothionein promoter {Nature 296:39-42 (1982)). 
The cloning vector containing the expression control sequences is cleaved using 
restriction enzymes and adjusted in size as necessary or desirable and ligated with 
sequences encoding the cancer-associated protein by means well known in the art. 

15 When higher animal host cells are employed, polyadenylation or 

transcription terminator sequences from known mammalian genes need to be 
incorporated into the vector. An example of a terminator sequence is the 
polyadenylation sequence from the bovine growth hormone gene. Sequences for 
accurate splicing of the transcript may also be included. An example of a splicing 

20 sequence is the VP1 intron from SV40 (Sprague, J. et al. 9 J. Virol 45: 773-781, 
(1983)). 

Additionally, gene sequences to control replication in the host cell 
may be incorporated into the vector such as those found in bovine papilloma virus 
type-vectors. Saveria-Campo, M., "Bovine Papillomavirus DNA a Eukary otic 

25 Cloning Vector" in DNA Cloning Vol.11 a Practical Approach Ed. D.M. Glover, IRL 
Press, Arlington, Virginia pp. 213-238, (1985). 

The transformed cells are cultured by means well known in the art. 
For example, such means are published in Biochemical Methods in Cell Culture and 
Virology, Kuchler, R.J., Dowden, Hutchinson and Ross, Inc. (1977). The expressed 

30 protein is isolated from cells grown as suspensions or as monolayers. The latter are 
recovered by well known mechanical, chemical or enzymatic means. 
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VII, Purification of the Proteins for Use with the Invention 



After expression, the proteins of the present invention can be purified 
to substantial purity by standard techniques, including selective precipitation with 
substances as ammonium sulfate, column chromatography, immunopurification 
5 methods, and other methods known to those of skill in the art. See, for instance, R. 
Scopes, Protein Purification: Principles and Practice, Springer- Verlag: New York 
(1982), U.S. Patent No. 4,673,641, Ausubel, and Sambrook, incorporated herein by 
reference. 

A number of conventional procedures can be employed when a 
0 recombinant protein is being purified. For example, proteins having established 
molecular adhesion properties can be reversibly fused to the subject protein. With 
the appropriate ligand, the cancer-associated protein, for example, can be selectively 
adsorbed to a purification column and then freed from the column in a relatively 
pure form. The fused protein is then removed by enzymatic activity. Finally, 
5 cancer-associated protein can be purified using immunoaffinity columns. 

A. Purification of Proteins from Recombinant Bacteria 

When recombinant proteins are expressed by the transformed bacteria 
in large amounts, typically after promoter induction, although expression can be 
constitutive, the proteins may form insoluble aggregates. There are several 

0 protocols that are suitable for purification of protein inclusion bodies. For example, 
purification of aggregate proteins (hereinafter referred to as inclusion bodies) 
typically involves the extraction, separation and/or purification of inclusion bodies 
by disruption of bacterial cells typically, but not limited by, incubation in a buffer of 
about 100-150 j^g/ml lysozyme and 0.1% Nonidet P40, a non-ionic detergent. The 

5 cell suspension can be ground using a Polytron grinder (Brinkman Instruments, 
Westbury, N.Y.). Alternatively, the cells can be sonicated on ice. Alternate 
methods of lysing bacteria are described in Ausubel and Sambrook and will be 
apparent to those of skill in the art. 

The cell suspension is generally centrifuged and the pellet containing 

d the inclusion bodies resuspended in buffer which does not dissolve but washes the 
inclusion bodies, e.g., 20 mM Tris-HCl (pH 7.2), 1 mM EDTA, 150 mM NaCl and 
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2% Triton-X 100, a non-ionic detergent. It may be necessary to repeat the wash step 
to remove as much cellular debris as possible. The remaining pellet of inclusion 
bodies may be resuspended in an appropriate buffer (e.g., 20 mM sodium phosphate, 
pH 6.8, 150 mM NaCl). Other appropriate buffers will be apparent to those of skill 

5 in the art. 

Following the washing step, the inclusion bodies are solubilized by 
the addition of a solvent that is both a strong hydrogen acceptor and a strong 
hydrogen donor (or a combination of solvents each having one of these properties). 
The proteins that formed the inclusion bodies may then be renatured by dilution or 
10 dialysis with a compatible buffer. Suitable solvents include, but are not limited to, 
urea (from about 4 M to about 8 M), formamide (at least about 80%, volume/volume 
basis), and guanidine hydrochloride (from about 4 M to about 8 M). Some solvents 
which are capable of solubilizing aggregate- forming proteins, such as SDS (sodium 
dodecyl sulfate) and 70% formic acid, are inappropriate for use in this procedure due 
15 to the possibility of irreversible denaturation of the proteins, accompanied by a lack 
of immunogenicity and/or activity. Although guanidine hydrochloride and similar 
agents are denaturants, this denaturation is not irreversible and renaturation may 
occur upon removal (by dialysis, for example) or dilution of the denaturant, allowing 
re-formation of the immunologically and/or biologically active protein of interest. 
After solubilization, the protein can be separated from other bacterial proteins by 
standard separation techniques. 

Alternatively, it is possible to purify proteins from bacteria 
periplasm. Where the protein is exported into the periplasm of the bacteria, the 
periplasmic fraction of the bacteria can be isolated by cold osmotic shock in addition 
to other methods known to those of skill in the art (see, Ausubel, supra). To isolate 
recombinant proteins from the periplasm, the bacterial cells are centrifuged to form a 
pellet. The pellet is resuspended in a buffer containing 20% sucrose. To lyse the 
cells, the bacteria are centrifuged and the pellet is resuspended in ice-cold 5 mM 
MgS04 and kept in an ice bath for approximately 10 minutes. The cell suspension is 
centrifuged and the supernatant decanted and saved. The recombinant proteins 
present in the supernatant can be separated from the host proteins by standard 
separation techniques well known to those of skill in the art. 



50 



WO 01/30964 PCT/US00/29126 

B. Standard Protein Separation Techniques For Purifying 
Proteins 

1. Solubility Fractionation 

Often as an initial step, and if the protein mixture is complex, an 
5 initial salt fractionation can separate many of the unwanted host cell proteins (or 
proteins derived from the cell culture media) from the recombinant protein of 
interest. The preferred salt is ammonium sulfate. Ammonium sulfate precipitates 
proteins by effectively reducing the amount of water in the protein mixture. Proteins 
then precipitate on the basis of their solubility. The more hydrophobic a protein is, 

10 the more likely it is to precipitate at lower ammonium sulfate concentrations. A 
typical protocol is to add saturated ammonium sulfate to a protein solution so that 
the resultant ammonium sulfate concentration is between 20-30%. This will 
precipitate the most hydrophobic of proteins. The precipitate is discarded (unless 
the protein of interest is hydrophobic) and ammonium sulfate is added to the 

1 5 supernatant to a concentration known to precipitate the protein of interest. The 
precipitate is then solubilized in buffer and the excess salt removed if necessary, 
through either dialysis or diafiltration. Other methods that rely on solubility of 
proteins, such as cold ethanol precipitation, are well known to those of skill in the art 
and can be used to fractionate complex protein mixtures. 

20 2. Size Differential Filtration 

Based on a calculated molecular weight, a protein of greater and 
lesser size can be isolated using ultrafiltration through membranes of different pore 
sizes (for example, Amicon or Millipore membranes). As a first step, the protein 
mixture is ultrafiltered through a membrane with a pore size that has a lower 
25 molecular weight cut-off than the molecular weight of the protein of interest. The 
retentate of the ultrafiltration is then ultrafiltered against a membrane with a 
molecular cut off greater than the molecular weight of the protein of interest. The 
recombinant protein will pass through the membrane into the filtrate. The filtrate 
can then be chromatographed as described below. 
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3. Column Chromatography 

The proteins of interest can also be separated from other proteins on 
the basis of their size, net surface charge, hydrophobicity and affinity for ligands. In 
addition, antibodies raised against proteins can be conjugated to column matrices 
5 and the proteins immunopurified. All of these methods are well known in the art. 

It will be apparent to one of skill that chromatographic techniques can 
be performed at any scale and using equipment from many different manufacturers 
(e.g., Pharmacia Biotech). 



10 It is noted that many of the sequences described herein are publicly 

available in GenBank, which is the NIH genetic sequence database, an annotated 
collection of all publicly available DNA sequences (Nucleic Acids Research 1998 
Janl;26(l):l-7). 

All publications and patent applications cited in this specification are 
15 herein incorporated by reference as if each individual publication or patent 

application were specifically and individually indicated to be incorporated by 
reference. 

Although the foregoing invention has been described in some detail 
by way of illustration and example for purposes of clarity of understanding, it will 

20 be readily apparent to those of ordinary skill in the art in light of the teachings of this 
invention that certain changes and modifications may be made thereto without 
departing from the spirit or scope of the appended claims. 

Table 1 below indicates genes by identification in the "GeneName" 
column that demonstrate change in expression with cancer. "Image ClonelD" refers 

25 to the IMAGE Consortium library clone identification number. "DataBaselD" is the 
database that lists the gene to which the clone has the highest homology: GB 
(GenBank), SP (SwissProt), or UG (UniGene). "Accession Number" is the gene 
identification number in the database indicated in the "DataBaselD" listing. Where 
a tissue is indicated in the "3X higher in normal tissue than cancer" column, it 

30 means that the gene is expressed at higher levels in normal healthy cells vs. the 
corresponding cancerous tissues. The cancerous tissues at issue include colon 1, 
kidney 55, kidney 105, kidney 510, kidney 1010, kidney renal cell carcinoma, 
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breast, breast lobularcarcinoma, liver and/or prostate as designated on the Table. 
Where a tissue is indicated in the "3X higher in cancerous tissue than normal" 
column, it means that the expression of the subject gene is significantly increased in 
cancerous vs. the corresponding normal healthy tissues. For example, the Cysteine- 

5 Rich Protein 2 (LifeSpan Master ID 952) is expressed at significantly higher levels 
in normal healthy colon cells than in the cancerous colon 1 cells. 
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WHAT IS CLAIMED IS : 

1 LA method for diagnosing cancer of a tissue of interest in a 

2 patient, comprising detecting the overexpression or the underexpression of a cancer- 

3 associated molecule in the tissue of interest in the patient, where overexpression or 

4 underexpression of such molecule is indicative of a cancer according to Table 1 . 

1 2. The method of claim 1 wherein underexpression of the 

2 cancer-associated molecule is indicative of cancer, and wherein the cancer- 

3 associated molecule is underexpressed in the patient. 

1 3. The method of claim 1 wherein overexpression of the cancer- 

2 associated molecule is indicative of cancer, and wherein the cancer-associated 

3 molecule is overexpressed in the patient. 

1 4. The method of claim 1 comprising detecting an mRNA 

2 encoding the cancer-associated molecule. 

1 5. The method of claim 1 comprising detecting the cancer- 

2 associated molecule in an immunoassay. 

1 6. The method of claim 1 wherein the tissue of interest is a tissue 

2 selected from the group consisting of liver, breast, prostate, kidney and colon. 

1 7. A method for inhibiting cancer comprising introducing into a 

2 cell a cancer-associated molecule according to Table 1, wherein underexpression of 

3 the cancer-associated molecule is indicative of cancer. 

1 8. The method of claim 7 wherein said cancer-associated 

2 molecule is a nucleic acid encoding a cancer-associated protein. 

1 9. The method of claim 7 wherein said cancer-associated 

2 molecule is a protein. 

1 10. A method for arresting cancer comprising inhibiting a cancer- 

2 associated molecule according to Table 1, wherein overexpression of the cancer- 

3 associated molecule is indicative of cancer. 
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1 11. The method of claim 1 0 wherein said cancer-associated 

2 molecule is inhibited using an antisense polynucleotide. 

1 12. The method of claim 10 wherein said cancer-associated 

2 molecule is inhibited using an antibody that specifically binds to the cancer- 

3 associated protein. 

1 13. A method for identifying a modulator of cancer development 

2 in a cell, the method comprising: 

3 culturing the cell in the presence of the modulator to form a first cell 

4 culture; 

5 contacting RNA or cDNA from the first cell culture with a probe 

6 which comprises a polynucleotide sequence associated with cancer development, 

7 wherein the polynucleotide sequence is selected from the group consisting of 

8 sequences set out in Table 1 ; 

9 determining whether the amount of the probe which hybridizes to the 

10 RNA or cDNA from the first cell culture is increased or decreased relative to the 

1 1 amount of the probe which hybridizes to RNA or cDNA from a second cell culture 

12 grown in the absence of said modulator; and 

1 3 detecting the presence of a decreased proliferative potential, 

14 transformation and malignancy in the first cell culture relative to the second cell 

1 5 culture. 

1 14. The method of claim 13 wherein the first and second cell 

2 cultures are obtained from a liver, kidney, breast, colon or prostate cell. 

1 1 5. A method for modulating cancer development in a patient in 

2 need thereof, the method comprising administering to the patient a compound that 

3 modulates the malignant transformation of a cell. 

1 16. The method of claim 15, wherein the modulator increases or 

2 decreases the expression of a nucleic acid sequence set out in Table 1 . 
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